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METHODS AND COMPOUNDS FOR MODULATING 
MELANOCORTIN RECEPTOR LIGAND BINDING AND ACTIVITY 

CROSS-REFERENCE l O RELATED APPLICATIONS 

This application is a conlinuation-in-part of PCT application no: 
5 PCT/US99/25201, filed on October 27, 1999, designating the United States of America, and 
also claims benefit of and priority to U S. provisional application no: 60/203,07 1 , filed on 
May 9, 2000, and U.S. provisional application no: 60/226,047, filed on August 16, 2000, all 
of which are incorporated herein by reference, in their entirety, for all purposes. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 
1 0 SPONSORED RESEARCH AND DEVELOPMENT 

This invention was made with Government support under Grant Nos. 
GM46870. and DK58606 awarded by the National Institutes of Health. The Government of 
the United States of America may have certain rights in this invention. 

FIELD OF THE INVENTION 

15 I he present invention relates to methods and compounds for modulating 

melanocortin receptor- ligand binding. 

BACKGROUND OF THE INVENTION 

The melanocortin (MC) receptors are a group of cell surface proteins that 
mediate a variety of physiological effects, including: regulation of adrenal gland function, 

20 such as production of glucocorticoid, Cortisol and aldosterone; control of melanocyte 

growth and pigment production; hemoregulation: immunomodulation; and analgesia. Five 
distinct MC receptors have been cloned and arc expressed in a variety of tissues, including 
melanocytes, adrenal corte.x, brain, gut, placenta, skeletal muscle, lung, spleen, thymus, 
bone marrow, pituitary, gonads and adipose tissue. Two MC cceplors. MC3R and MC4R. 

25 are expressed in brain tissue, while MClr is found in the skin. A variety of ligands termed 
melanocoitins function as agonists that stimulate the activity of MC receptors. The 
mclanocortins include melanocyle-stimulating hormones (MSH) such as alph-MSH. bela- 
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MSH and gamma-MSH, as well as adrcnocorticolropic hormone. Another MCR ligand that 
has been discovered is the Agouti Related Protein (AGRP), which is an antagonist. 

AGRP is a natural antagonist of melanocortin receptors 3 and 4 (MC3R and 
MC4R). Recent biochemical investigations have identified AGRP as playing a major role 
5 in the regulation of m.am.malian feeding behavior. The human AGRP is a 132 residue 
polypeptide (SEQ ID NO: 1) that is a naturally occurring competitive antagonist of 
melanocortin receptors 3 and 4 ("MC3r" and "MC4r"), the ovcrcxpression of which results 
in adult onset obesity and diabetes in mice (Shutter et ai (1997) Genes Dev. 1 1:593-602; 
Huszar etal. (1997 ) Cell 88:131-141; Hahn,^^/a/. (1997) Nature Neurosci. 1:271-171). 
]0 AGRP binding to MC4r in particular is the subject of intense interest since 

knockout mrice that do not express MC4r exhibit the same phenotype as caused by 
overexpression of AGRP (Huszar, et ai. supra). There is also evidence for the parallel 
expression of AGRP and neuropeptide Y in the arcuate nucleus of the hypothalamus, with 
neuropeptide Y known to stimulate feeding (Hahn, et ai, supra). This region of the brain 
15 also expresses MC4r and is involved in energy homeostasis. Research has also focused on 
other melanocortin receptors, their antagonists and methods for modulating receptor 
activity. See for example, Wei et ai, WO9943709. 

The growing body of evidence linking AGRP to weight control has yet to 
elucidate its exact mechanism of action. However, studies on AGRP do benefit from 
20 analogy to the much more widely studied agouti protein, as AGRP was originally identified 
through the homology of its C-terminal region with the same region of the agouti protein 
(Shutter et ai, supra). The agouti protein has been a focal point in obesity research for a 
number of years, since ectopic expression of the wild-type protein in mice leads to obesity 
and related disorders, i.e. the same symptoms as the overexpression of the more recently 
25 identified AGRP (Klebig, et ai, (1995) Proc. Natl. Aead. Sci. USA 92:4728-4732; 

Michaud, et al. (1997) J. Endocrinol 155:207-209. However, unlike AGRP, agouti has 
distinct expression patterns in mice and humans, making in vivo work with mice less 
applicable to human obesity disorders. AGRP, like agouti, is selective for MC3r and MC4r 
but has approximately 100-fold greater binding affinity than agouti at these receptors (Fong, 
30 et al. (1997) Biochem. Res. Commun. 237:629 611). 

While full length agouti and AGRP are only 25CJ homologous, in their 46 
residue Cys-rich C-terminal regions, nine of the 10 Cys residues are spatially conserved and 
there are a funher 10 identical residues giving -40Vr sequence identity. Three consecutive. 
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conserved residues RFF ( 1 1 1-1 13 in human AGRP) were determined lo be essenlial to the 
biological activity ot both agouti (Kicter. el al. (1997) Bioclicmistry 36;2()84-2()9(): Kielcr, 
ct al. (1998) Biocheimstry 37: 991-997) and AGRP (Tota et al. (1999) Bioclwmi.slrx 38:897- 
904). Two recent investigations have shown that the chemically synthesized C-tcrminal 
S rcaion of AGRP competitively antagonizes f;t-melanocyte stimulating hormone (a-MSH) at 
melanocortin receptors with equal or greater potency than the full proteins (Qui 1 Ian, ct al. 
(1998) FEBS Leu. 428:59-62; Yang, et al. (1999) Mol. Endocrinol. 13: 148-155), consistent 
with similar findings for agouti (Willard, ct al. (1995) Biochemistry 34:12341-12346). 
Thus the Cys-rich C-terminal region of AGRP, is referred to as minimized agouti related 
10 protein ("MARP", CVRLH5ESCLG,oQQVPC,5CDPCA2oTCYCR25FFNA F30CYCR 
K',5LGTAM4uNPCSR4sT, SEQ ID NO:2, where subscripts denote amino acid position). 

The covalent structure of MARP exhibits five disulfide bonds, which exist 
between the following ten Cys residues (Bures, et al. (1988) Biochemistry 37:12172- 
12177): Cys, and Cysu,; Cysg and Cys22; Cysi5 and Cys,,; Cysi9 and Cys43; Cys24 and Cyssi. 
15 Despite the important biological activities of AGRP, no experimental 3D 

structure has been available for this protein. The inhibitor cystine knot ("ICK") family of 
proteins are also disulfide-rich and the structures of these invertebrate toxins have been used 
to suggest possible structures for the agouti and AGRP C-termmal regions (Kiefer, ct al. 
(1988) Biochemistry 37:991-997; Tota, el al.. supra). Indeed, the recently reported disulfide 
20 map for AGRP and a construct containing the C-terminal domain demonstrate ICK-like 
pairings (Norton, et al. (1998) Toxicon 36:1573-1583) of the 10 Cys residues: 1-16, 8-22, 
15-33, 19-43, 24-31 (using MARP numbering, Bures, et al., supra). Beyond such 
modeling, it is believed that the only structural data published on either agouti or AGRP are 
circular dichroism (CD) spectra which suggest that both proteins have little regular 
25 secondary structure, although there may be some P-sheet structure, consistent with ICK 
structural characlerislics (Willard, et al., supra; Rosenfeld. et al. (1998) Biochemistry 
37:16041-16052). 

U.S. Patents of interest in this area include: 5.994.087; 5.932,779; 5,869.257; 
5,843,652; 5.817.787; 5,766,877; 5,731.408; 5.703.220; and 5.622,860. Also of interest are: 
30 WO 99/64002; WO 99/57 148; WO 99/55679; WO 99/55832; WO 99/54358; WO 

99/50295; WO 99/43709; WO 99/31508; WO 99/21571; WO 98/56914; WO 98/10068; 
and WO 97/43412. 
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SUMMARY OF THE INVENTION 

This invcnlion provides a detailed deseription of the three-dimensional (3D) 
structure {NMR structure) in solution of the human AGRP Cys-rich C-lcrminal region. In 
addition, further minimization of human AGRP is achieved and a minimal sequence 

5 (designated herein as MARP-33) is identified that provides both activity and receptor 
specificity against MC3r and MC4r. Information about this region permits design of 
compounds that bind to the ligand binding site of melanocortm receptors and modulate 
ligand binding to the receptor. The compounds include agonists and antagonists that 
modulate melanocortin receptor activity by promoting (agonists) or blockmg (antagonists) 

10 ligand binding to the receptor and/or by activating the receptor themselves. In particular, 

using iniormation pioviucQ ncrciii, a cia:!>.:) vji p^^piivaOiiiiiiiwLi^b v^.^. ^iwi. ^^^^^.^^^ ,.^^x..^u.., — 
provided that are ligands for the melanocortm receptor and that can block and/or modulate 
activity of the melanocortin receptors. 

In one preferred embodiment, the peptides this emmbodiments provides a 

1 5 melanocortin receptor ligand that is a polypeptide comprising a peptide sequence having the 
formula: CX^X^X^X^X^X^^CXlx'X^X^^X^ ^X^^CCDPX'^ATCYCX'^X^^X^'^NAFCYCRn 
(Formula I), where X\ X^ X^ X^ X^ X\ X\ X\ X^ X^^ X'\ X'\ X'\ X'^ X'\ and X'' 
are independently selected amino acids, and n is zero or one. The amino acids include 
native ammo acids, both D- and L-fomn ammo acids, and modified or derivatized ammo 

20 acids or amino acid analogues. In certain particularly preferred embodiments, X , X^ X , 
X'^, X^ X'\ X\ X^ X'\ X'^, X^', X^", X'\ X^*^, X^^ and X^^' are independently selected from 
the group consisting of alanine, asparagine, arginine,aspartic acid, cysteine, glutamine, 
glutamic acid, glycine, histidine, isoleucme, leucine, lysine, methionine, phenylalanine, 
proline, serine, threonine, tryptophan, tyrosine, and valine. In certain embodimetns, the 

25 polypeptide consists of a polypeptide having the formula shown above (Formula I). 

In certain preferred embodiments, the peptide is not AGRP and said 
polypeptide is not MARP. In certain prefeired embodiments, the polypeptide excludes one 
or more of the final 13 residues of MARP (residues 34-46 of MARP). In certain 
embodiments, the polypeptide is not CVRLHESCLGQQVPCCDPAA TCYCRFFNAFCYC 

30 (SEQ ID NO:3) or does not comprise this sequence. 

In certain embodiments. xVx'x'X^X' X^' is VRLHES, or conservative 
substitutions thereof, anchor X'x'x''x'*'x'^X'' is LGQQVP. or conservative substitutions 
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thercor, and/or X'^X"x'^' is RFF or conservalivc subslilulions thereof. In cerlain 
embodiments, ' is not a cysteine and in particularly preferred embodiments, X' ' is A. 



In addition to the peptide melanoc(Mlin ligands described above, this 
invention provides a non-pcptidc melanocorlin receptor ligand of the structural formula of 
3 Formula I shown herein m which B, Ui, LN, Ri and R2 are independently selected from 
the group consisting of: hydrogen, alkyi, derivati/cd alkyl, cycloalkyi, derivatized 
cycloalkyi, halocycloalkyi, aloxycycloalkyl, heterocycloalkyl, aryl, arylalkyl, heteroaryl, 
and heteroarylalkyl; J is selected from the group consisting of carbon, nitrogen, silicon, and 
sulfur; X is selected from the group consisting of hydrogen, carbon, nitrogen, oxygen, 

10 silicon, and sulfur; and Z is selected from the group consisting of a continuing peptide bond, 
a hydroxyl; -NH2-, -NH-(n), and -N-(n,n'), and -0-(y), where where n and n* are 
independently selected from the group consisting of alkyl, alkenyl, alkynyl, cycloalkyi, aryl, 
arylalkyl, heteroaryl, heteroarylalkyl, or a derivatized form thereof, and y is selected from 
the group consisting of alkyl, alkenyl, alkynyl, cycloalkyi, aryl, arylalkyl, heteroaryl 

15 heteroarylalkyl, or a derivatized form thereof. In preferred embodiments, the ligand is a 
ligand for a MC3r and/or an MC4r melanocortin receptor. Particularly preferred non- 
peptide ligands have a molecular weight ranging from about 200 to 1000 daltons. In 
prefen*ed embodiments, the ligand has a structure that mimics the backbone of the AGRP 
active loop. Preferred ligands comprise a terminal gaunidino moiety and/or at least one 

20 methylbenzyl moiety. Particularly preferred ligands include the ligands of Formulas III, IV, 
V, and VI, shown herein. 

In another embodiment, this invention also provides libraries for screening 
for modulators of a melanocortin receptor. Preferred libraries comprise a plurality of 
polypeptide and/or non-polypeptide members that are ligands (e.g. as described above) that 

25 bind to a melanocortin receptor (e.g, MC3r, MC4r, etc.). Preferred libraries comprise at 

least 10, more preferably al least 100, and most preferably at least 1000 different members. 
In certain embodiments, the libraries are provided in multi-well plates. 

Also provided are pharmaceutical compositions comprising one or more of 
the melanocoitin-binding peptide and/or non-peptide ligands of this invention (e.g. as 

30 described above). The pharmaceutical compositions optionally further comprise a 
pharmaceutically acceptable excipient. 

In certain embodiments, this invention provides methods of modulating the 
activity of a melanocortin receptor (e.g. MC3r. MC4r, etc.) andL^or methods of modulating a 
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mclanocorlin receptor mediated physiological process. The methods involve contacting the 
receptor with a peptide ligand and/or with a non-peptide ligand of this invention (c^- '^^ 
described above). In preferred cmbcxliments, the contacting blockage of the receptor site 
and/or upregulation or downregulation of melanocortin receptor activity as measured 
5 by cAMP assays). In certain embodiments the ligand is a melanocortin agonist. In certain 
embodiments, the hgand is a melanocortin receptor antagonist. 

This invention also provides methods of prescreening for a modulator of a 
melanocortin receptor (e.,^. MC3r, MC4r, etc.). The methods involve contacting a 
melanocortin receptor with one or more of the melanocortin receptor peptide ligands and/or 

10 non-peptide ligands of this invention (e.g. the ligands described above), and detecting 

binding of the peptide to the melanocortin receptor wherein specific binding of the peptide 
to the melanocortin receptor indicates that said peptide is a potential modulator of the 
melanocortin receptor. 

In still another embodiment, this invention provides methods of screening for 

15 a modulator of melanocortin receptor activity. The methods involve contacting a 

melanocortin receptor (e.g, MC3r, MC4r, etc.) with a peptide ligand or a non-peptide ligand 
of this invention and detecting activity of the melanocortin receptor wherein a difference in 
activity of the receptor, as compared to a control, indicates that the ligand is a modulator of 
melanocortin receptor activity. In preferred embodiments, the control is a negative control 

20 comprising the same assay without the ligand. 

This invention also provides a method of identifying a compound that 
modulates ligand binding to a melanocortin receptor, where the method involves modeling 
test compounds that fit spatially into a melanocortin receptor ligand binding site of interest 
using an atomic structural model of a melanocortin receptor binding region or portion 

25 thereof; screening the test compounds in an assay characterized by binding of a test 

compound to a melanocortin receptor ligand binding site; and identifying a test compound 
that modulates ligand binding to said melanocortin receptor. In preferred embodiments, the 
melanocortin receptor binding region comprises the minimized agouti related protein 
receptor binding region (mini-AGRF) or a portion thereof. In preferred embodiments, the 

30 atomic structural model comprises atomic coordinates of amino acid residues corresponding 
to residues 1-18 of the N-terminal loop of the minimized agouti related protein (residues 1- 
18 of SEQ ID NO:2), residues 19-13 of the central loop of the minimized agouti related 
protein (residues 19-34 of SEQ ID NO:2), and residues 35-46 of the C-terminal loop of the 
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minimi/cd agouti related protein (residues 35-46 of SEQ ID NO:2). In certain preterrcd 
embodiments, the atomic structural model comprises atomic coordinates of amino acid 
residues corresponding to residues 19-34 of the central loop of the minimi/cd agouti related 
protem (residues 19-34 of SEQ ID NO:2) and at least residues I 5- 1 8 of the N-terminal loop 
5 of the minimized agouti related protein (residues 15-18 of SEQ ID NO:2). In cenain 

prefeiTed embodiments, the atomic structural model comprises atomic coordinates of amino 
acid residues corresponding to residues 19-34 of the central loop of the minimized agouti 
related protein (residues 19-34 of SEQ ID NO:2) and at least 20% of the contiguous or non- 
contiguous residues or their atoms are selected from residues 1-18 of the N-terminal loop oi 
10 the minimized agouti related protein (residues 1-18 of SEQ ID NO:2). The atomic 

structural m.odel can comprises atomic coordinates of amino acid residues con-esponding to 
residues 24-31 of the active loop of the minimized agouti related protein (residues 24-31 of 
SEQ ID NO:2). In certain embodiments, the atomic structural model comprises atomic 
coordinates of ammo acid residues corresponding to residues 25-27 of the active loop of the 
15 minimized agouti related protein (residues 25-27 of SEQ ID NO:2). The screening is 
preferably in vitro. In certain embodiments, the screening is high throughput screening 
(HTS ). Preferred assays include, but are not limited to biological assays. The test 
compound can be provided from a library of compounds and preferred test compounds are 
agonists or antagonists of ligand binding. Particulariy preferred test compounds include a 
20 small organic molecule, a peptide, or peptidomimetic. 

In another embodiment this invention provides methods for identifying an 
agonist or antagonist of ligand binding to a melanocortin receptor, said method comprising 
the steps of: providing the atomic coordinates of a melanocortin receptor binding region or 
portion thereof to a computerized modeling system; modeling compounds which match or 
25 mimic the receptor binding region and thus fit spatially into the melanocortin receptor 
ligand binding site; and identifying in an assay for melanocortin receptor activity a 
compound that increases or decreases the activity of said melanocortin receptor by binding 
the ligand binding site of said melanocortin receptor, whereby an agonist or antagonist of 
ligand binding is identified. In prefeiTed embodiments, the melanocortin receptor binding 
30 region comprises the minimized agouti related protein receptor binding region or portion 
thereof\ 

Also provided is a machine-readable data storage medium, comprising a data 
storage material encoded with machine readable data which, when using a machine 
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programmed w ith instructions tor using said data, is capable of displa\ ing a graphical three- 
dimensional representation of a molecule that binds a melanocortm receptor comprising 
structure coordinates of amino acid residues corresponding to residues 1-18 of the N- 
terminal loop of the minimized agouti related protein (residues 1-18 ofSEQ ID NO:2), 
5 residues 19-13 ot the central loop of the minimized agouti related protein (residues 19-34 of 
SEQ ID NO:2). and residues 35-46 of the C-terminal loop of the minimized agouti related 
protein (residues 35-46 of SEQ ID NO:2), or a homologue of said molecule. In certain 
embodiments, the molecule is a mclanocortin receptor agonist or antagonist. In particularly 
preferred embodiments, the molecule is defmed by the set of structure coordinates depicted 

10 in Table 4 or Table 5, or a homologue of the molecule, the homologue having a root mean 
square deviation from the backbone atoms of said amino acids of not more than 10, 
preferably no more than 5 and most preferably no more than 2.54A. 

A machine-readable data storage medium is also provided that comprises a 
data storage material encoded with a first set of machine readable data that, when combined 

15 with a second set of machine readable data, using a machine programmed with instructions 
for using said first set of data and said second set of data, can determine at least a portion of 
the structure coordinates coiresponding to the second set of machine readable data, wherein 
the first set of data comprises a Fourier transform of at least a portion of the structural 
coordinates selected from the group consisting of coordinates depicted in Table 4 or Table 

20 5; and the second set of data comprises an X-ray diffraction pattern of a molecule. 

In still another embodiment this invention provides an NMR structure of a 
minimized agouti related protein (mini-AGRP), embodied in a computer readable media. 

This invention also provides a polypeptide comprising the amino acid 
sequence: CVRLHESCLGQQVPCCDPAATCYCRFFNAFCYC (SEQ ID NO:3) or a 

25 modified form thereof, where the polypeptide is not a full-length AGRP and the polypeptide 
is not a MARP. Prcfcned such polypeptides are chemically synthesized. 

Also provided is a method of treating a disease state in mammals that is 
alleviated by treatment with a polypeptide having an amino acid sequence: 
CVRLHESCLGQQVP CCDPAATCYCRFFN AFCYC (SEQ ID NO:3) where the method 

30 comprises administering to a mammal in need of such a treatment a therapeutically effective 
amount of said pohpeptide, or a pharmaceutically acceptable salt thereof. In certain 
embodiments, the disease state is a v\ asting syndrome. This invention also includes a 
pharmaceutical composition comprising a therapeutica!l\ effective amount of a pohpeptide 
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ol lhc sequence: C VRLHFiSCLGQQVPC CDFAA TCYCRFFN Af'C YC (SEQ ID NO;3) or 
a pharmaceulicall y acceptable salt thereof. 

DKI IMTIONS 

T[ie terms "residue" or "amino acid" as used herein refers to natural, 
5 synthetic, or modified amino acids (amino acid analogues). Such amino acids include both 
"D" and "L" forms. Various amino acid analogues include, but are not limited to, 2- 
aminoadipic acid, 3-aminoadipic acid, beta-alanine, beta-aminopropionic acid, 2- 
aminobutyric acid, 4-aminobutyric acid, piperidinic acid, 6-aminocaproic acid, 2- 
aminoheptanoic acid, 2-aminoisobutyric acid, 3-aminoisobutyric acid, 2-aminopimelic acid 

10 2,4- diaminobutyric acid, desmosine, 2,2'-diaminopimelic acid, 2,3-diaminopropionic acid, 
N-ethylglycine, N-cthylasparagine, hydroxylysine, allo-hydroxylysine, 3-hydroxyproline, 4 
hydroxyproline, isodesmosine, allo-isoleucine, N-methylglycme, sarcosine, N- 
methylisoleucine, 6-N-methyllysine, norvaline, norleucine, ornithine, etc. 

The terms "polypeptide", "peptide" and "protein" are used interchangeably 

15 herein to refer to a polymer of amino acid residues. The terms apply to amino acid 

polymers in which one or more amino acid residue is an artificial chemical analogue of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino acid 
polymers. The term also includes variants on the traditional peptide linkage joining the 
amino acids making up the polypeptide. 

20 The term "peptidomimetic" as used herein refers to a molecule that 

structurally and chemically resembles a peptide of two or more amino acids with respect to 
the features critical for a particular desired activity (e.g. receptor specificity, activity, etc.). 
The term "peptidomimetic" is includes peptide analogs that serve as appropriate substitutes 
for peptides in interactions with e.g., receptors and enzymes. Peptidomimetics can be 

25 "peptides" incorporating modified residues and/or backbones, e.g. that may have improved 
pharmacokinetic properties as a result of proteolytic stability, or unique structural and/or 
hydrogen bonding motifs, etc. Peptidomimetics also include organic molecules that are 
capapble of mimicing one more properties (e.g. binding specificity, affinity, etc.) of a 
peptide. Such organic molecule typically comprise a "scaffold" that mimics part or all of 

30 the 3-dimensional structure of the subject peptide and places appropriate functional groups 
in a spatial orientation sufficient to achieve the desired functional properties. Particularly 
prefeired peptidomimetics include small organic molecules. 
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The term "small organic molecule" refers to a molecule of a si/c comparable 
to those organic molecules generally used m pharmaceuticals. The term excludes biological 
macromolcculcs {c.,^.. proteins, nucleic acids, etc.). Preferred small organic molecules range 
in si/e up to about 5000 Da, more prclerably up to 2000 Da, and most preferably up to 

5 about 1000 Da. 

The term database refers to a means for recording and retrieving information. 
In prefeired embodiments the database also provides means for sorting and/or searching the 
stored information. The database can comprise any convenient media including, but not 
limited to, paper systems, card systems, mechanical systems, electronic systems, optical 

10 systems, magnetic systems or combinations thereof. Preferred databases include electronic 
(e.g. computer-based) databases= Computer systems for use in storage and manipulation of 
databases are well known to those of skill m the art and include, but are not limited to 
"personal computer systems", mainframe systems, distributed nodes on an inter- or intra- 
net, data or databases stored in specialized hardware (e.g. in microchips), and the like. 

15 The term "conservative substitution" is used in reference to proteins, 

peptides, etc. to reflect amino acid substitutions that do not substantially alter the activity 
(specificity or bmdmg affinity) of the molecule. Typically, conservative amino acid 
substitutions involve substitution of one amino acid for another amino acid with similar 
chemical properties (e.g. charge or hydrophobicity). The following six groups each contain 

20 amino acids that are typical conservative substitutions for one another: 1) Alanine (A), 
Serine (S), Threonine (T); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagme (N), 
Glutamme (Q); 4) Arginine (R), Lysme (K): 5) Isoleucme (I), Leucme (L), Methionme (M), 
Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

The terms "binding partner", or "capture agent", or a member of a "binding 

25 pair" refers to molecules that specifically bind other molecules to form a bindmg complex 
such as antibody-antigen, lectin-carbohydrate, nucleic acid-nucleic acid, biotin-avidin, etc. 

The term "specifically binds", as used herein, when refeiring to a 
biomolecule (e.g., protein, nucleic acid, antibody, receptor, etc.), refers to a binding reaction 
which is determinative of the presence biomolecule in heterogeneous population of 

30 molecules (e.g,, proteins and other biologies). Thus, under designated conditions (e.g. 

immunoassay conditions in the case of an antibody or stringent hybridization conditions in 
the case of a nucleic acid), the specified tigand or antibody binds to its particular "target" 
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molecule and docs not bind in a significant amount to other molecules present in the 
sample. 

"Alkyl" means a linear saturated monovalent hydrocarbon radical of one to 
six carbon atoms or a branched saturated monovalent hydrocarbon radical of three to six 
5 carbon atoms, c\h-. methyl, ethyK propyl, 2-propyl, butyl, pentyl, and the like, 

"Alkenyl" means a linear monovalent hydrocarbon radical of two to six 
carbon atoms or a branched monovalent hydrocarbon radical of three to six carbon atoms 
containing at least one double bond, e.g., ethenyl, 2-propenyl, and the like. 

"Alkynyl" means a linear monovalent hydrocarbon radical of two to six 
10 carbon atoms or a branched monovalent hydrocarbon radical of three to six carbon atoms 
containing at least one triple bond, e.^.. ethynyK propynyl, butynyl, and the like. 

"Cycloalky!" means a cyclic saturated monovalent hydrocarbon radical of 
three to seven carbon atoms, c\g,, cyclopropyl, cyclohexyl, and the like. 

"Halo" means fluoro, chloro, bromo, and iodo. 
15 "Haloalkyl" means alkyl substituted with one or more halogen atoms, 

including those substituted with different halogens, e.g.rCW2C\, --CF3, --CH2CF3, -- 
CF2CF3, --CH2CCI3, and the like. 

"Alkoxy", "alkenyloxy", "cycloalkyloxy", or "haloalkyloxy" means a radical 
-OR where R is alkyl, alkenyl, cycloalkyi, or haloalkyl respectively as defined above, e.g., 
20 methoxy, ethoxy, propoxy, 2-propoxy, ethenyloxy, cyclopropyloxy, cyclobutyloxy, --OCHi 
CI, "OCF3, and the like, 

"Alkylthio" or "cycloalkylthio" means a radical --SR where R is alkyl or 
cycloalkyi respectively as defined above, methylthio, butylthio, cyclopropylthio, and 
the like. 

25 "Acyl" means a radical -C(0)R where R is hydrogen, alkyl, or haloalkyl as 

defined above, e.g.. formyl, acetyl, tnfluoroacetyl, butanoyl, and the like. 
"Amino" means a radical — NH2 

"Monosubstituted ammo" means a radical --NHR where R is alkyl or acyl, 
e.g.. methylamino, ( l-methylethyl)amino, and the like. 
30 "Disubstituted amino" means a radical --NRR' where R and R' are 

independently alkyl or acyl, t\,£,^,dimethylamino. methylethylamino, did- 
miCthylethyOamino, and the like. 
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"H ydrox yalk\ 1" means a linear monovalenl hydrocarbon radical of two to six 
carbon atoms or a branched monovalent hydrocarbon radical of three to six carbons 
substituted with one or two hydroxy groups, provided that il" two hydroxy groups are 
present they are not both on the same carbon atom. Representative examples include, but 
5 arc not limited to, 2-hydroxycthyK 2-hydroxypropyl, 3-hydroxypropyl, l-(hydroxymcthylj- 
2-methylpropyl, 2-hydroxybutyK 3-hydroxybutyl, 4hydroxybuty!, 2,3-dihydroxypropyK 1- 
(hydroxymethyl)-2-hydroxyethyl, 2,3-dihydroxybutyl, 3,4dihydroxybutyl and 2- 
(hydroxymethyl)-3-hydroxypropyK 2-hydroxyethyK 2,3-dihydroxypropyl, and 1- 
(hydroxymethyl)-2-hydroxyethyl. 
10 "Alkoxyalkyl" means a linear monovalent hydrocarbon radical of one to six 

carbon atoms or a branched monovalent hydrocarbon radical of three to six carbons 
substituted with at least one alkoxy group as defined above, e.g., 2-methoxyethyl, 2- 
methoxypropyl, and the like. 

"Hydroxyalkyloxy" or "alkoxyalkyloxy" means a radical-OR where R is 
15 hydroxyalkyl or alkoxyalkyl respectively as defined above, e.g., 2-hydroxyethyloxy, 2- 
methoxyethyloxy, and the like. 

"Aminoalkyl" means a linear monovalent hydrocarbon radical of two to six 
carbon atoms or a branched monovalent hydrocarbon radical of three to six carbons 
substituted with at least one — NRR' where R and R' are independently selected from 
20 hydrogen, alkyl, or acyl, e.g., 2-aminoethyl, 2-N,N-diethylaminopropyl, 2-N- 
acetylaminoethyl, and the like. 

"Aryl" means a monovalent monocychc or bicyclic aromatic hydrocarbon 
radical of 6 to 12 ring atoms, and optionally substituted independently with one or more 
substituents selected from alkyl, haloalkyk cycloalkyl, alkoxy, alkylthio, halo, nitro, acyl, 
25 cyano. amino, monosubstituted amino, disubstituted amino, -hydroxy, carboxy, or 

alkoxycarbonyl. Representative examples include, but are not limited to, phenyl, biphenyl. 
1-naphthyl, and 2-naphthyl and the derivatives thereof. 

"Heteroaryl" means a monovalent monocyclic or bicyclic aromatic radical of 
5 to 10 ring atoms containing one or more, sometimes one or two ring heteroatoms selected 
30 from N, O, or S, the remaining ring atoms being C. The heteroaryl ring is optionally 

substituted independently with one or more substituents, sometimes one or two substituents, 
selected from alkyl, haloalky], cycloalkyl, alkoxy, alkylthio, halo, nitro, acyl, cyano, amino, 
monosubstituted amino, disubstituted amino, hydroxy, carboxy, or alkoxycarbonyl. 
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Specifically the term heteroaryl includes, but is not limited to, pyndyl, pyrrolyl, thienyl, 
turanyl. indolyl, quinolyl, bcn/opyranyl, and thia/olyl, and the dernatives thereof. 

"Hetcrocycloamino" means a saturated monovalent cyclic group of 3 to 8 
ring atoms, wherein at least one ring atom is N and optionally contains a second ring 
hetei'oatom selected from the group consisting of N, O, or S(0) „ (where n is an integer from 
0 to 2), the remaining ring atoms being C. The hetcrocycloamino ring may be optionally 
fused to a benzene ring or it may be optionally substituted independently with one or more 
substituents, sometimes one or two substituents, selected from alkyi, haloalkyi, cycloalkyi, 
cycloalkylalkyi, aryl, aralkyi, heteroaryl, heteroaralkyl, halo, cyano, acyl, amino, 
monosubstituted amino, disubstituted amino, carboxy, or alkoxycarbonyl. More specifically 
the term heterocycloamino includes, but is not limited to, pyrrolidine, piperidino, 
morpholino, piperazino, mdolino, and thiomorpholmo, and the derivatives thereof. 

"Heterocycio" means a saturated monovalent cyclic group of 3 to 8 nng 
atoms in which one or two ring atoms are heteroatoms selected from N, O, or 5(0)^, where 
n is an integer from 0 to 2, the remaining ring atoms being C. The heterocyclo ring may be 
optionally fused to a benzene ring or it may be optionally substituted independently with 
one or more substituents, sometimes one or two substituents, selected from alkyl, haloalkyi, 
cycloalkyi, cycloalkylalkyi, aryl, aralkyi, heteroaralkyl, halo, cyano, acyl, monosubstituted 
amino, disubstituted amino, carboxy, or alkoxycarbonyl. More specifically the term 
heterocyclo includes, but is not limited to, pyrrolidino, piperidino, moipholino, piperazino, 
tetrahydropyranyl, and thiomorpholino, and the derivatives thereof. 

"Cycloalkylalkyi" means a radical -R^'R^ where R"" is an alkylene group and 
R^ is a cycloalkyi group as defined above e.g., cyclopropylmethyl, cyclohexylpropyl, 3- 
cyclohexyl-2-methylpropyl, and the like. 

"Cycloalkylalkyloxy" means a radical -OR where R is a cycloalkylalkyi 
group as defined above e.f^.. cyclopropylmethyloxy, 3~cyclohexylpropyloxy, and the like. 

"Aralkyi" means a radical - R^ R ^ where R^ is an alkylene group and R ^ is 
an aryl group as defined above e.i^., benzyl, phenylethyl, 3-(3-chlorophenyl)-2- 
methylpentyl, and the like. 

"Heteroaralkyl" means a radical - R^ R'^ where R^ is an alkylene group and R'^ 
is a heteroaryl group as defined above e.i^., 2-3-, or 4-pyridylmethyl, furan-2-ylmethyl and 
the like. 
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"Hclcrocycloalkyl" means a radical - R^* R'^ where R^' is an alkylene group 
and R'^ is a hcterocyclo group as defmed above inorphohn-4-ylelhyl, tetrahydrofuran 2- 

yhnelhyl and the hkc. 

A "pro-drug" is a compound that releases an active drug (^^^^ a mini-AGRP, 
5 a peptidomimelic as described herein) when such prodrug is administered to a mammahan 
subject. In certain embodiments, prodrugs arc prepared by modifying functional groups 
present in the peptidomimetics of this invention or by modifyng mini-AGRP polypeptides 
in such a way that the modifications may be cleaved in vivo to release the active compound. 
Peptidomimetic prodrugs include the peptideomimetics descnbed herein wherein a hydroxy, 
10 amino, or sulfhydryl group in compound is bonded to any group that may be cleaved in vivo 
to regenerate the free hydroxyh amino, or sulfhydryl group, respectively. Examples of 
prodrugs include, but are not limited to esters ie.)^., acetate, formate, and benzoate 
derivatives), carbamates (e.g., N,N-dimethylaminocarbonyl) of hydroxy functional groups 
in compounds of formula (I), and the like. 
15 "Optional" or "optionally" means that the subsequently described event or 

circumstance may, but need not, occur, and that the description includes instances where the 
event or circumstance occurs and instances in which it does not. For example, "heterocyclo 
group optionally mono- or di- substituted with an alkyl group" means that the alkyl may, but 
need not, be present, and the description includes situations where the heterocyclo group is 
20 mono- or disubstituted with an alkyl group and situations where the heterocyclo group is not 
substituted with the alkyl group. 

Compounds that have the same molecular formula but differ in the nature or 
sequence of bonding of their atoms or the arrangement of their atoms m space are tenned 
"isomers." Isomers that differ in the arrangement of their atoms in space are termed 
25 "stereoisomers;' Stereoisomers that are not minm images of one another are termed 

"diastereomcrs" and those that are non-supenmposable mirror images of each other are 
termed "enantiomers." When a compound has an asymmetric center, for example, it is 
bonded to four different groups, a pair of enantiomers is possible. An enantiomer can be 
characterized by the absolute configuration of its asymmetric center and is described by the 
30 R- and S-sequencing rules of Cahn and Prelog, or by the manner in which the molecule 

rotates the plane of polarized light and designated as dextrorotatory or levorotalory (i.e., as 
(+) or (-)-isomers respectively). A chiral compound can exist as either individual 
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cnantiomcr or as a mixture thereof. A mixture contaming equal proportions of the 
enantiomers is called a "racemic mixture/' 

The compounds of this invention may possess one or more asymmetric 
centers: such compounds can therefore he produced as individual (R)- or (S)- stereoisomers 
5 or as mixtures thereof. Unless indicated otherwise, the description or naming of a particular 
compound in the specification and claims is intended to include both individual enantiomers 
and mixtures, racemic or otherwise, thereof. The methods for the determination of 
stereochemistry and the separation of stereoisomers are well-known in the art (see 
discussion in Chapter 4 of "Advanced Organic Chemistry", 4th edition J. March, John 

10 Wiley and Sons, New York, 1992). 

A "pharmaceutically acceptable excipient" means an excipient that is useful 
in preparing a pharmaceutical composition that is generally safe, non-toxic and neither 
biologically nor otherwise undesirable, and includes an excipient that is acceptable for 
veterinary use as well as human pharmaceutical use. "A pharmaceutically acceptable 

15 excipient" as used in the specification and claims includes both one and more than one such 
excipient. 

A "pharmaceutically acceptable salt" of a compound means a salt that is 
pharmaceutically acceptable and that possesses the desired pharmacological activity of the 
parent compound. Such salts include: (1) acid addition salts, formed with inorganic acids 

20 such as hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid, and 
the like; or formed with organic acids such as acetic acid, propionic acid, hexanoic acid, 
cyclopentanepropionic acid, glycolic acid, pyruvic acid, lactic acid, malonic acid, succinic 
acid, malic acid, maleic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, 3-(4- 
hydroxybenzoyl)benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, 

25 ethanesulfonic acid, i,2-ethanedisulfonic acid, 2-hydroxyethanesulfonic acid, 

benzenesulfonic acid, 4-chlorobenzcnesulfonic acid, 2-naphthalenesulfonic acid, 4- 
toluenesulfonic acid, camphorsulfonic acid, 4methylbicyclo>2.2.2!oct-2-ene-l-carboxylic 
acid, glucoheptonic acid, 4,4'-methylenebis-(3-hydroxy-2-ene- 1 -carboxylic acid), 3- 
phenylpropionic acid, trimethylacetic acid, tertiary butylacetic acid, lauryl sulfuric acid, 

30 gluconic acid, glutamic acid, hydroxynaphthoic acid, salicylic acid, stearic acid, muconic 

acid, and the like; or (2) salts formed when an acidic proton present in the parent compound 
either is replaced by a metal ion, e.g.. an alkali metal ion, an alkaline earth ion, or an 
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aluminum ion; or coordinates with an organic base such as cthanotaininc, diethanolaminc, 
triclhani^laminc, tromcthaminc, N-methylglucaminc, and the hkc. 

"Treating" or "treatment" of a disease includes: ( 1 ) preventing the disease, 
i.e. causing the chnical symptoms of the disease not to develop in a mammal that may be 
S exposed to or predisposed to the disease but does not yet experience or display symptoms ol 
the disease, (2) inhibiting the disease, i.e.. arresting or reducing the development of the 
disease or its clinical symptoms, or (3) relieving the disease, i.e., causing regression of the 
disease or its clinical symptoms. 

A "therapeutically effective amount" means the amount of a compound that, 
10 when administered to a mammal for treating a disease, is sufficient to effect such treatment 
for the disease. The "therapeutically effective amount" will vary depending on the 
compound, the disease and its severity and the age, weight, etc., of the mainmal to be 
treated. 

The term database refers to a means for recording and retrieving information. 

15 In preferred embodiments the database also provides means for sorting and/or searching the 
stored information. The database can comprise any convenient media including, but not 
limited to, paper systems, card systems, mechanical systems, electronic systems, optical 
systems, magnetic systems or combinations thereof. Preferred databases include electronic 
(t'.g. computer-based) databases. Computer systems for use in storage and manipulation of 

20 databases are well known to those of skill in the art and include, but are not limited to 

"personal computer systems", mainframe systems, distributed nodes on an inter- or intra- 
net, data or databases stored in specialized hardware {e.g. m microchips), and the like 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A and IB show the Cc^ backbone of the MARP minizxd average 
25 structure determined at 500 mllz and 800 mHz, respectively. Figure 1 A: The N-terminal 
loop, central loop and C-terminal loop are indicated. Disulfide bonds are represented by 
dashed lines. Spheres represent residues with amides protected from HX for more than 12 
hours, more than 24 hours and more than 8 days. Figure IB shows a C(alpha) backbone 
representation of the lowest energy AGRP(87-132) structure of the family of 40 determined 
30 from 800 MHz NOESY data and deposited in the PDB (code: IHYK). The N-terminal loop 
is show in light grey at the lower left,the central loop is dark grey (top middle), and the C- 
tcrminal loop is shown at the lower right. The disulfide bonds are included as thin dashed 
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white lines. The spheres highlight residues the amides of which were protected from HX: 
medium gray > 12 hours, light gray > 24 hours and dark gray > 8 days. 

Figure 2A depicts the heavy atom (non-hydrogen) backbone representation 
of MARP tor 14 NMR structures with residues 1-34 lit to the minimized average structure 

5 (Root Mean Square Deviation, "RM.SD" 1 .49A). Only the minimized average structure 
(thick cylinder) is shown for the more disordered C-terminal loop. Figure 2B shows an all 
atom backbone representation of AGRP( 87-132) for the 40 structures deposited in the PDB 
calculated from 800 MHz NOESY data. Only the backbone atoms of 1-34 have been 
aligned (RMSD = 0.34 angstroms). The disulfide-bound residues are indicated with 

10 straight lines interconnecting regions of the backbone, and are included for all members of 
the family. 

Figure 3 shows the backbone atoms for residues 24-31 of the family of 20 
structures with residues 24-3 1 fit to the minimized average structure. The side chain heavy 
atoms of residues 25, 26 and 27, essential for activity, are shown. 
15 Figure 4 illustrates a reaction scheme for the synthesis of peptidomimetics of 

this invention. 

DETAILED DESCRIPTION 

The agouti related protein ("AGRP") is a mammalian signaling molecule, 
involved in weight homeostasis, that causes adult onset obesity when overexpressed in 

20 mice. AGRP was originally identified by homology to the agouti protein, another potent 
signaling molecule involved in obesity disorders in mice. While AGRFs exact mechamsm 
of action is unknown, it has been identified as a competitive antagonist of melanocortin 
receptors 3 and 4 ("MC3r" and "MC4r"). MC4r in particular is implicated in the 
hypothalamic control of feeding behavior. Full length agouti and AGRP are only 25% 

25 homologous, however, their active C-terminal regions are -40% homologous, with nine out 
of the 10 Cys residues spatially conserved. Until now, 3D structures have not been 
available for either agouti, AGRP or their C-terminal regions. 

In one embodiment, this invention provides a detailed three-dimensional 
(3D) NMR structure in solution of the human AGRP Cys-nch C-terminal region as 

30 determined by 'H NMR using a protein prepared by total chemical synthesis. As used 

herein the term "the NMR structure" is understood to refer to the minimized average of the 
family of NMR structures. Because biochemical investigations demonstrate that this 
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minimal region rclams lull liiological activily. tliis protein is rcterrcd lo lierein as mmimi/cd 
agouti related protein ("MARP"). MARP residues !-46 (SEQ ID NO:2). correspond to 
human AGRP residues 87-132 (residues 87-132 of SEQ ID NO:!). Thus, human AGRP 
numbering is obtained by adding 86 lo MARP numbering. 
5 MARP's topology is characterized by three large loops (referred to herein as 

the N-terminal loop, the central loop and the C-tcrminal loop), with four of the live disulfide 
bridges at the base of the structure, and an absence of canonical secondary structure such as 
helices or sheets. Two of the three loops are structurally well characterized by the NMR 
data as mdicated by low RMSDs. The region of MARP containing the RFF triplet (Tota, et 
10 al., supra) (residues 25-27 in MARP) necessary for function is located in one of the best 
defined resions of the protein. While previously reported structural models of the C- 
terminal region of AGRP were attempted based on Cys homology between AGRP and 
certain toxin proteins, Cys spacing was not sufficient to correctly determine the 3D fold of 
the molecule. It was initially thought, based on 500 mHz data that MARP did not adopt an 
1 5 ICK-like fold, however, new data obtained at 800 mHZ (Table 5 herein) shows that MARP 
does adopt the ICK-like fold. 

The 3D structure of MARP provided herein presents a basis for the 
development of methods and compositions for identifying compounds that modulate 
melanocortin receptor activity, in particular the activity of MC3r and MC4r. The 3D 
20 structure presents precise structural information that permits the rational design of 
compounds that preferentially modulate MC4r or MC3r activity. 

Based on an analysis of this structure, a minimal AGRP domain is identified 
(designated as a mim-AGRP) that shows both melanocortin activity and melanocortin 
specificity. Residues are identified that can be systematically or randomly altered to 
25 produce a large number of mini-AGRPs having differing receptor specificity and/or binding 
affinity. 

Collections of such mini-AGRPs provide convenient libraries that can be 
screened to identify mini-AGRPs having particular activity/specificity profiles. THese 
AGRPs can be used to modulate melanocortin receptor activity {in vivo or /// vitro) or they 
30 can be used as "lead compounds" for the design of peptido mimetics. 

In addition a class of peptidomimctics is identified herein that are specific 
ligands to melanocortin receptors (e.ii. MC3r and/or MC4r). In certain emliodimcnts. the 
peptidomimctics are used to modulate melanocortin receptor activity in vivo or in vitro. In 
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other embodiments, they arc provided as libraries that can be screened for peptidomimetics 
having particular desired specificity/activity profiles. 

I. Structure of MARP 

The 3D structure of MARP is characterized by three loops held together at 

5 the base by an apparent scaffold of four disulfide bonds M6, 8-22, 15-33 and 19-43. The 
fifth disulfide bond, 24-31, further stabilizes the base of the active loop which presents the 
RFF triplet on the protein surface. There is no identifiable canonical helical structure. The 
800 mHz data reveal a small beta sheet structure. The RFF triplet is critical for the activity 
of MARP as a competitive antagonist of a-MSH stimulated activation of MC4r signaling. 

10 The structure described herein shows that MARP is structured to present the side chains of 
the RFF triplet on the surface of the protein and to the surrounding solvent. Recent work 
demonstrates that MARP is much more active than smaller AGRP derived peptides 
containing the RFF triplet (Tota, et al. (1999) Biochemistry 38:897-904). Thus, the detailed 
fold of the central loop and perhaps the presence of the N- and C-terminal loops are 

15 important for AGRP function. In addition, based upon work with chimeras of melanocortin 
receptors, we believe the N- and C-terminal loops confer receptor subtype specificity. 

The previous absence of structural data on both AGRP and agouti 
encouraged the modeling of the C-terminal regions of these proteins onto the ICK family 
(Norton, et al., supra)) which is charactenzed by homologous Cys spacing (Kiefer, et al., 

20 Biochemistry 37:991-997 (1998); Tota, et al., supra). The ICK family of proteins primarily 
consists of small «60 residues) disulfide-rich (three or four disulfides) toxin proteins from 
the venom of spiders and cone snails, which function as ion channel antagonists (Norton, et 
al., supra). The coincidence between the function of the majority of these toxins and the 
recent description of part of the agouti protein's mechanism of action being calcium 

25 dependent (Kim, et al., FASEB J. 10; 1646-1652 ( 1996); Kim. et al., A;h. J. Physiol 
272:E379-384 ( 1997); Jones, et al., Am, J. Physiol. 270:E192- 196 (1996)) further 
encouraged these homology modeling efforts. The ICK motif in particular is characterized 
by the topology of the three disulfide bonds coiTcsponding to 1-16, 8-22 and 15-33 in 
MARP. In the ICK motif the first two disulfide bonds with their intervening main chain 

30 atoms form a topological circle through which the third disulfide bond passes, forming the 
cxstine knot (Norton, et af, supra). The motif is further characterized by the identification 
of an irregular tnple stranded antiparallel P-sheet, roughly coiTcsponding to residues 6-8. 
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20-24 and 31-34 in MARP. The remaining two disulfide bonds in MARP each occur in 
individual ICK proteins as separate examples of potential "non-motif" disulfide bonds, 
although no examples of ICK motif proteins with five disulfide bonds have been observed. 
Despite these apparent similarities, the experimental structure of MARP 
5 determined at 500 mH/, suggested that this protein did not satisfy the criteria required for 
inclusion in the ICK family. While the first two disulfide bonds in MARP 1-16 and 8-22, 
together w ith the polypeptide backbone form a topological circle, none of the remaining 
disulfides passes through the circle to form a cystine knot. Instead, disulfide bond 15-33 is 
positioned adjacent to the circle with all of the fold on one side of this circle. In addition, 

10 MARP lacks the P-sheet found in ICK family proteins. The experimental determination of 
the distinctive 3D structure of MARP described herein suggests that Cys spacing and even 
the disulfide map of small Cys-rich proteins may not always be sufficient to accurately 
predict protein folds. These results speak to the potential limitations of "homology 
modeling" of protein structures, and may have important implications for the emerging field 

15 of genomic structural biology. However, based on the 800 mHz NMR data, we have not 
assigned MARP to the ICK class. 

As described in the Examples, ligand binding studies, and analysis of atomic 
models derived from the MARP NMR structure reveal for the first time a previously 
unknown structure for MARP and its receptor binding region. By "receptor binding region" 

20 is intended a structural segment or segments of melanocortin receptor ligands, and MARP 
in particular, folded in such a way so as to give the proper geometry and ammo acid residue 
conformation for binding to a melanocortin receptor. By "ligand binding site" is intended a 
structural segment or segments of melanocortin receptor polypeptide chain folded in such a 
way so as to give the proper geometry and amino acid residue conformation for binding a 

25 ligand. These are the physical arrangement of protein atoms in three-dimensional space 
forming a receptor binding region or a ligand binding site pocket or cavity. 

The MARP structure has three major loops: the N-terminal loop, residues 1- 
18 (residues 1-1 8 of SEQ ID NO:2), the central loop, residues 19-34 (residues 19-34 of SEQ 
ID NO:2) and the C-terminal loop, residues 35-46 (residues 35-46 of SEQ ID NO:2). 

30 Residues forming the receptor binding region are amino acids corresponding to (i.e., the 
same as or equivalent to) residues 24-3 1 of the central loop (residues 24-3 1 of SEQ ID 
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NO:2), rciciTcd lo herein as the "active" loop. In particular, residues 25, 26 and 27 (the 
"RFF" triplet) of the active loop are critical for activity. 

It has been ibund that a polypeptide comprising the central loop, preferably 
at least a portion of the N-terminal loop, is desirable for optimal biological activity. As 
5 used herein, the term "at least a portion of the N-terminal loop" is intended lo mean a 

sequence that corresponds lo (i.e., the same as or equivalent to), at least residues 15 to 18 ot 
Ihe N-terminal loop (residues 15-18 of SEQ ID NO:2), preferably at least residues 8 to 18 of 
the N-terminal loop (residues 8-18 of SEQ ID NO:2), and is also intended to include all of 
the N-terminal loop, i.e,, residues 1 lo 18 of the N-termmal loop (residues 1-18 of SEQ ID 
10 NO:2). The term is also intended lo mean a sequence thai corresponds lo al least 20%, 
nreferablv al least 60%. and more preferably at least 90%, of the contiguous or non- 
contiguous amino acid residues or their atoms selected from amino acid residues 1 lo 18 of 
the N-terminal loop (residues 1-18 of SEQ ID NO:2). 

II. Structure of mini-AGRP 

15 Anolher aspecl of the invention also pertains lo the idenlification of 

"minimal" polypeptides (mini-AGRPs) that show specific binding and/or activity al 
melanocortin receptors. show. Certain preferred mini-AGRPs are strong antagonists of 
melanocortin receptor types 3 and 4, and are useful for antagonizing melanocortin receptor 
in a variety of contexts, e.g. for the treatment of eating disorders and obesity. As indicated 

20 above, research has demonstrated that the final 46 residues of the human AGRP sequence 
("MARP"), possesses full receptor antagonist activity. The structure of MARP, solved by 
use of nuclear magnetic resonance and described herein, indicated that the final 13 residues 
of MARP are unstructured are unlikely to play a significant role m receptor binding or 
antagonism. Thus, substantial further minimization of human AGRP was achieved. 

25 Specifically, the sequence (indicated by the single letter amino acid code): 

CVRLH5ESCLGioQQVPCi5CDPAA:oTCYCR25FFNAF:,(,CYC (SEQ ID NO:3) designated 
"MARP-33" (based upon its 33 amino acid length) is believed to fold with proper native- 
like disulfide bonds and possesses the full biological activity of MARP. This sequence is 
equivalent to the first 33 amino acids of MARP but contain a Cys to Ala substitution at 

30 position 19 (indicated as "A") to avoid having a non-bridged thiol. 

Anolher mini-AGRP was made having the sequence: 
CVRLHESCLGQQVPCCDPAATCYCRPTNAFCYCR (SEQ ID NO:4). This sequence is 
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equivalent lo the first 34 amino acids of MARP. but contain a Cys to Ala substitution at 
position V) (indicated as "A") to avoid having a non-biidgcd thiol. The additional terminal 
residue (R) lacilitates solid state synthesis. This mini-AGRP was shown lo spccirically bind 
an anlagoni/c mclanocorlin receptors. 

In addition, the mini-ARGPs can be modilied to improve binding specificity, 
and/or to alter activity (f'.,^'. agoni.st, antagonist, compcilive inhibitor, etc. ). Such 
modifications include, but are not limited to changing the sequence to increase binding 
affinity, to increase the level of antagonism and to increase stability of the molecule. In 
particularly prefeired embodiments, the amino acid substitutions are made in the RFF 
residues (residues 25-27 of SEQ ID NO:3) and/or in the N-terminal loop (e.g. one or more 
of residues 2-7 and/or residues 9-14 of SEQ ID NO:3). 

In particularly prefened embodiments, mini-ARGPs are represented by 

formula I: 

CX'x'x'x'x'x'CX'x'x'x'°X-^x'"CCDPX"ATCYCX'^x''x''^-NAFCYCRn I 



where X\ X\ X\ X\ X\ X^ X^ X", X'", X", X'^ X'\ X>\ X'^ and X'" are 
independently selected amino acids (including natural, synthetic, or modified amino acids); 
and n is zero or one. In certain embodiments, in each of the varied domains, one or more of 
20 the native residues can be preserved. Thus, for example. X'xlxlX^X^X" includes, but is 
not limned to VX^X'^X-^X^X", X'RxWx", X'X^LX^X^X", X'X^X^HX^X", 
X'X^X'X'X-'^S, VRX^X^X-'X^ VX2LX'*X■^\^ VXlX^HXlx". VX'xlX^EX^ VX'x\x^X=S, 
X'RLX^X^X^ X'RX•m5x^ X'RX^X^EX", X'RX^X^X^S, X'X^LHX^X", X'X^LX-'X^X^ 
X'X'LX^EX^ X'x'LX'x'S, x•X'X-^HEX^ X'X^X^HX^S, X'X-X^'X^ES, VRLX^X^X^ 
25 VX-LHX-\X*', VRLHES and the like. Similar permutations are available for 

.\WX"'X"X'^ (e.K. LGQQVP, LX^X"'X'"X"X>1 X V,X"x">X"x'^ xlX«QX'"x"x'^ 
XVX'QX"X'-, XIX'X"X"VX'-, X"X'X"X'"X"R LGX'\\'"X"X'-, LX*^QX'"x"x'-. 
LxVqx"x'-. LX'x"x'"VX", LX'.X"x'"x"P, LGQX'*'x"x". and the like). 
Similarly, the "RFF" domain can be fully mutated or can retain one or more of the native 
residues' Thus, for example, X'^X'^X'" includes RFF. R'^X'-^X'^ X'^FX'^ X'^X'^F, 
RFX'*\ RX^'F, X''FF. In certain preferred embodiments, X is not cysteine. 
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The inini-AGRPs form a class of mclanocorlin receptor binding ancl/or 
modulaling agents and thus find utihty as modulators {c\i^. uprcgulators, dovvnrcgulators, 
competitive inhibitors) of melanocortin receptors, particular lVIC3r and/or MC4r. The 
compounds can be used individually or combined into a library suitable for screening for 
S members having particular activities, binding affinities, and the like. 

Certain mini-AGRPs are good potential therapeutics for the treatment of 
eating disorders and obesity, or can be used as therapeutic lead compounds for the 
development of therapeutics (e.g. as models for peptidomimetics, etc.). Where the 
compounds are used as therapetics, they can be administered to a patient (human or non- 
10 human mammal) in need thereof, to increase eating and fat deposition. 

Ill, Preparation of mini-AGRP s and mini-AGRP libraries. 

In certain embodiments, this invention provides libraries of mini-AGRPs. 
The libraries typically comprise a plurality of 

Preferred libraries comprise at least 20, preferably at least 50, more 
15 preferably at least 100, and most preferably at least 10,000, 50,000, 100,000, or even at least 
1,000,000 different members. 

Using the mini-AGRP sequence information provided herein, such libraries 
can be routinely prepared using methods well known to those of skill m the art. Such 
methods include, but are not limited to "traditional" chemical syntheses methods, light- 
20 directed chemical syntheses, and recombinant expression. 

Solid phase peptide synthesis in which the C-terminal amino acid of the 
sequence is attached to an insoluble support followed by sequential addition of the 
remaining amino acids in the sequence is the preferred method for preparing the peptide 
compounds of the present invention. Techniques for solid phase synthesis are well known t 
25 those of skill in the art (sec, e.g., Barany and Merrifield, Solid-Phase Peptide Synthesis: pp. 
3-284 in The Peptides: Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide 
Synthesis, Part A., Merrifield, et al. J. Am. Chem. Soc. 85, 2149-2156 ( 1963), and Gross 
and Meienhofer, eds. Academic press, N.Y.. 1980 and Stewart et al.. Solid Phase Peptide 
Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 111. (1984)). Solid phase synthesis is most 
30 easily accomplished with commercially available peptide synthesizers utilizing FMOC or 
TBOC chemistry. 
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In particularly preferred cmbodimcnls, peptide synthesis is pcrlormcd using 
F-moc synthesis. For example. AGRP C-lerminal portion (MARP) can be readily 
synthesized using techniques described in Yang, et al. supra. Each desired peptide can be 
individually synthesized. Where large numbers ofdilTcrcnt peptides arc desired (e.t^. for a 
5 library) combinatorial synthesis methods are available. Combinatoriai peptide synthesis can 
be performed according to the methods of Furka el al, (1991) Int. J. Pept. Protein Res. 37 
487-493. At the desired positions that are to be altered the synthesis resins are separated 
coupled with the desired amino acid(s) and then pooled for the next coupling reaction. Such 
methods are easily accomplished using multiple peptide synthesizers. 
10 After synthesis, the proteins are purified if necessary {e.g. via HPCL), re- 

folded and disulfide bonds are formed. Formation of intrapeptide disulfide bonds is often 
achieved by oxidation of the free thiol or sulfur-protected precursors under varying reaction 
conditions {see, e.g., Andreu et al (1994) Pages 91-169 In Peptide Synthesis Protocols; 
Pennington, M. W., Bunn. B. M., Ed.; Humana Press: New Jersey; Moroder et al (1996) 
15 Biopolymers 40: 207-234; Annis and Barany (1997) Meth. Enzymol, 289: 198-221 ; Tarn et 
al (1991)7. Am. Chetn. Soc. 111:. 6657-6662; Munson and Barany (1993) J. Am. Chem. 
Soc. 115: 10203-10216; Shik (1993) J. Org. Chem. 58: 3003-3008; Annis and Barany 
(1998) J. Am. Chem. Soc, 120: 7226-7238; Shi and Rabenstein (1999) J. Org. Chem. 64: 
4590-4595). Suitable oxidants include, but are not limited to iodine, thallium(III) 
20 tnfluroacetate (Fujii et al{l9ST) Chem. Pharm. Bull, 35: 2339-2347), and the like. 

Alternatively, recombinant expression methods can be used to produce the 
peptides of this invention. In this approach, a nucleic acid encoding the desired polypeptide 
(and optionally a purification tag, e.g. Hiso) is provided in an appropirale vector. A cell 
{e.g. E. coli. SF-3 cell, etc.) is transfected with the nucleic acid and, under appropriate 
25 conditions, transcribes and translates the desired protein. The protein is then recovered, re- 
folded, if necessary and cross-linked using standard methods well known to those of skill in 
the art. 

Molecular cloning techniques to achieve these ends arc known in the art. A 
wide variety of cloning and in vitro amplification methods are suitable for the construction 
30 of recombinant nucleic acids. Examples of these techniques and instructions sufficient to 
direct persons of skill through many cloning exercises are found, e.g., in Berger and 
Kimmel. (1989) Guide to Molecular Cloning Techniques, Methods in Enzymology volume 
152 Academic Press, Inc., San Diego. CA. Sambrook el al (1989) Molecular Cloning - .4 
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Laboratory Manual (2nd cd.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Press, NY; and Ausubel et ai (1994) Current Protocols in Molecular Biolo^^w Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 
Sons, Inc., 

5 IV. Lse of molecular models to design small molecules that bind the melanocortin 
receptor ligand binding site. 

One aspect of the invention involves methods for identifying and designing 
small molecules that bind to the ligand binding site using atomic models of MARP. In 
particular, the invention provides methodsof identifying a compound that modulates ligand 

10 binding to a melanocortin receptor. The method involves modeling test compounds that 
mimic or match the 3D conformation of MARP and therefore are expected to fit spatially 
into a melanocortin receptor ligand binding site of interest, using an atomic structural model 
of a melanocortin receptor binding region or portion thereof, preferably comprising the 
MARP receptor binding region or portion thereof. The test compounds can fit spatially into 

15 the ligand binding site of interest based upon a geometric fit of its three-dimensional 
structure or based upon the spatial arrangement of atoms presenting specific chemical 
properties such as charge and hydrophobicity. The test compounds are then screened in an 
assay, such as a biological assay, characterized by binding of a test compound to a 
melanocortin receptor ligand binding site, and identifying a test compound that modulates 

20 ligand binding to the melanocortin receptor. Details of the atomic structural model are 
described in detail herein. 

The atomic coordinates of MARP and its receptor binding region, provided 
herein, can be used for modeling to identify other compounds or fragments that bind 
melanocortin receptors. By "modeling" is intended quantitative and qualitative analysis of 

25 molecular structure/function based on atomic structural information and receptor-ligand 
agonists/antagonists interaction models. This includes conventional numeric-based 
molecular dynamic and energy minimization models, interactive computer graphic models, 
modified molecular mechanics models, distance geometry and other structure-based 
constraint models. Modeling is preferably perfonned using a computer and may be further 

30 optimized using known methods. By 'Tits spatially" is intended that the three-dimensional 
structure of a compound is accommodated geometrically by a cavity or pocket of a 
melanocortin receptor ligand binding site or by a pattern of charge or hydrophobicity. 
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Compounds of particular inlercst fit spatially and preferentially into the 
ligand binding site. By "fits spatially and preferentially" is intended that a compound 
possesses a three-dimensional structure and conformation for selectively interacting with a 
melanocortin receptor ligand binding site. Compounds that fit spatially and preferentially 
5 into the ligand binding site interact with amino acid residues forming the ligand binding 
site. More specifically, these compounds mimic or match the MARP receptor binding 
region. The present invention also includes a method for identifying a compound capable of 
selectively modulating ligand binding to different melanocortin receptors. The method 
compnses the steps of modeling test compounds that fit spatially and preferentially mto the 

10 ligand binding site of a melanocortm receptor of interest using an atomic structural model of 
o MARP, screening the test compounds in a biological assay for melanocortin receptor 
activity characterized by preferential binding of a test compound to the ligand binding site 
of a melanocortin receptor, and identifying a test compound that selectively modulates the 
activity of a melanocortin receptor. Such receptor-specific compounds are selected that 

1 5 exploit differences between the ligand binding sites of one type of receptor versus a second 
type of receptor. 

The invention also is applicable to generating new compounds that 
disnnguish melanocortin receptor isofonns. This can facilitate generation of either tissue- 
specific or function-specific compounds. 

20 The receptor-specific compounds of the invention preferably interact with 

conformationally constrained residues of the ligand binding site that are conserved among 
one type of receptor compared to a second type of receptor. "Conformationally 
constrained" is intended to refer to the three-dimensional structure of a chemical or moiety 
thereof having certain rotations about its bonds fixed by various local geometric and 

25 physical-chemical constraints. Conformationally constrained structural features of a ligand 
binding site include residues that have their natural flexible conformations fixed by various 
geometric and physical-chemical constraints, such as local backbone, local side chain, and 
topological constraints. These types of constraints are exploited to restrict positioning of 
atoms involved in receptor-ligand recognition and binding. 

30 As described in the Examples, residues 25, 26 and 27 of the MARP receptor 

binding region are shown to be critical for activity. These three residues, along with the 
overall structure and composition of the N-termmus and central loop, appear to be necessary 
for optimal biological activity. Accordingly, modification to these residues can be exploited 
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in the idcntil'ication and design of compounds that modulate ligand bmding to one 
mclanocortm rcccplor compared to another. 

1-or modeling, docking algorithms and computer programs that employ them 
can be used to identify compounds that match or mimic the MARP receptor binding region. 
5 For example, docking programs can be used to predict how a small molecule of interest can 
interact with the mclanocortin receptor ligand binding site. Fragment-based docking also 
can be used in building molecules de novo inside the ligand binding site, by placing 
chemical fragments that complement the site to optimize intermolecular interactions. The 
techniques can be used to optimize the geometry of the binding interactions. This design 
10 approach has been made possible by identification of the receptor binding region structure 
.u,.., r^rin'-ini'-s of rr.r,lf>r-ii1:>r rpcnanition can now be used to design a compound which 
matches the structure of this region. Compounds that structurally match or mimic the 
MARP receptor binding region serve as a starting point for an iterative design, synthesis and 
test cycle in which new compounds are selected and optimized for desired properties 
15 including affinity, efficacy, and selectivity. For example, the compounds can be subjected 
to addition modification, such as replacement and/or addition of R-group substituents of a 
core structure identified for a particular class of binding compounds, modeling and/or 
activity screening if desired, and then subjected to additional rounds of testing. 

Computationally small molecule databases can be screened for chemical 
20 entities or compounds that can bind in whole, or in part, to a melanocortin receptor ligand 
binding site of interest. In this screening, the quality of fit of such entities or compounds to 
the binding site may be judged either by shape complementarity (DesJalais et al., J. Med. 
Chem. (1988) 31:122-129) or by estimated interaction energy (Meng et al., J. Camp. Chem. 
(1992) 7.?:505-524). The molecule databases include any virtual or physical database, such 
25 as electronic and physical compound library databases, and are preferably used in 
developing compounds that modulate ligand binding. 

Compounds can be designed intelligently by exploiting available structural 
and functional information by gaining an understanding of the quantitative structure-activity 
relationship (QSAR). using that understanding to design new compound libraries, 
30 particularly focused libraries having chemical diversity of one or more particular groups of 
a core structure, and incorporating any structural data into that iterative design process. For 
example, one skilled in the art may use one of several methods to screen chemical entities or 
fragments to compare them to the 3D structure of the AGRP C-terminus and thus, their 

-27- 



4071-9809 !()US 

ability to associate with the ligand binding site of a mclanocortin receptor ot interest. This 
process may begin by visual inspection o\\ tor example, the receptor binding region on the 
computer screen. Selected fragments or chemical entities may then be positioned into all or 
part of the region. Docking may be accomplished using software such as Quanta and Sybyl, 
5 followed by energy minimization and molecular dynamics with standard molecular 
mechanics force-fields, such as CHARMM and AMBER. 

Residues comprising a receptor binding region can be modeled to look tor 
energetic contributions and interaction with the bound chemical entity. For example, a 
compound or fragment can be designed to contain hydrophobic groups that mteract with 
10 hydrophobic residues of the ligand binding site. 

Specialized computer programs may also assist in the process of selecting 
chemical entity fragments or whole compounds. These include: GRID (Goodford, J. Med. 
Chem. (1985) 2<S:849-857; available from Oxford University, Oxford, UK); MCSS 
(Miranker et al., Proteins: Structure, Function and Genetics, (1991) 77:29-34; available 
1 5 from Molecular Simulations, Burlington, MA); AUTODOCK (Goodsell et al., Proteins: 
Structure, Function and Genetics (1990) S: 195-202; available from Scripps Research 
Institute, La Jolla, CA); and DOCK (Kuntz et al, J. Mol. Biol. (1982) 767:269-288; 
available from University of Califomia, San Francisco, CA). 

Additional commercially available computer databases for small molecular 
20 compounds include Cambridge Structural Database and Fine Chemical Database (Rusinko, 
Chem. Des. Auto. News (1993) S:44-47). 

Once suitable chemical entities or fragments have been selected, they can be 
assembled into a single compound. Assembly may be proceeded by visual inspection of the 
relationship of the fragments to each other on the three-dimensional image displayed on a 
25 computer screen in relation to the structure coordinates of a mclanocortin receptor. This can 
be followed by manual model building using software such as Quanta or Sybyl. 

Useful programs to aid one of skill in the art in connecting the individual 
chemical entities or fragments include: CAVEAT (Bartlett et al.. "CAVEAT: A Program to 
Facilitate the Structure-Derived Design of Biologically Active Molecules", In: Molecular 
30 Recognition in Oiemical and Biological Problems", Special Pub., Royal Chem. Soc. (1989) 
78: 182-196: CAVEAT is available from the University of California. Berkeley. CA); 3D 
Database systems such as MACCS-3D (MDL Information Systems, San Leandro. CA; 
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reviewed in Martin, J. Med. Chem. (1992) .i^:2145-2154): and HOOK (available from 

Molecular Simulations, Burlington, MA). 

In addition to building a compound in a step-wise fashion, one fragment or 

chemical entity at a time as described above, compounds that bind to a ligand binding site of 
5 interest also may be designed as a whole or de novo using some poition(s) of the AGRP C- 

terminus, which is a molecule known to bind to the site. These methods include: LUDI 

(Bohm, / Comp. Aid, Molec. Design (1992) 6:61-78; LUDI is available from Biosym 

Technologies, San Diego, CA); LEGEND (Nishibata et al., Tetrahedron (1991) ^7:8985; 

LEGEND is available from Molecular Simulations, Burlington, MA); and LeapFrog 
10 (available from Tripos Associates, St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance 

with this invention. See, for example, Cohen et al., J. Med. Chem. (1990) ii:883-894); 

Navia et al., Ciirr. Opm. Struct. Biol. (1992) 2:202-210). For example, where the structures 

of test compounds are known, a model of the test compound may be superimposed over the 
15 model of the structure of the invention. Numerous methods and techniques are known in 

the art for performing this step, any of which may be used. See, for example, Farmer, 

''Drug Design:' Ariens, E.J., ed., 10:1 19-143 (Academic Press, New York, 1980); U.S. 

Patent No. 5,331,573; U.S. Patent No. 5,500,807; Verlinde, Structure, (1994) 2:577-587); 

and Kuntz et al., Science, (1992) 257: 1078-1082). The model building techniques and 
20 computer evaluation systems described herein are not a limitation on the present invention. 

Using these computer modeling systems a large number of compounds may 

be quickly and easily examined and expensive and lengthy biochemical testing avoided. 

Moreover, the need for actual synthesis of many compounds can be substantially reduced 

and/or effectively eliminated. 

25 V. AGRP peptidomimetics. 

Using the three-dimensional model of AGRP provided herein, a class of non- 
peptide melanocortin receptor ligands (peptidomimetics) were designed. The subject 
ligands are low molecular weight compounds that structurally mimic the AGRP active loop 
backbone. The subject non-peptide compounds find use in a variety of different 
30 applications, including the modulation of melanocortin receptor mediated physiological 
processes. 
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By ligands of mclanocorlm receptors (MCR) is meant that the subject non- 
pcptide compounds bind to melanocortin receptors. In many embodiments, the subject 
compounds preferentially bind to the folknving melanocortin receptors: MC3R, as described 
in U.S. Patent No. 5,837,521, the disclosure of which is herein incorporated by reference; 
5 and MC4R, as described in U.S. Patent No. 5,703,220, the disclosure of which is herein 
incorporated by reference. As the subject non-peptide compounds are iMCR ligands, the 
subject compounds bind to an MCR with an affinity corresponding to a Kd of about 50 mM 
or lower, preferably of about 5 mM or lower, more preferably of about 500 |iM or lower, 
and most preferably about 50 |LiM, 5 |aM, or 1 [xM or lower. In many embodiments, the 
10 affinity of the subject non-peptide compounds for an MCR, usually either MC3r or MC4r, 

r^,,^ ..U,^,,+ f^-o^ 'iKo«it inn ii\/T fr\ n l r\\/\ npH rnrirF' ii<iiinnv ilbnut 10 uM tO 

i an ii,e:5 1 1 l^i 1 1 u.uv.jui i ai 1 1:,^^.^ 1 1 v/i 1 1 (Aiyvji i fAi.> i w . i i , . i . ,.,.^..11-,..^. „i . . . -j^. 1 j.,. ^ . - 

0. 1 nM As the subject compounds are non-peptide compounds, they are protease resistant. 
By protease resistant is meant that the subject compounds are resistant to proteolytic 
cleavage, at least by the proteases described in Miller et ai (1994), Bioorg Med Cheni Let 4: 

15 2657-2662. The subject non-peptide compounds are low molecular weight compounds. By 
low molecular weight is meant that the subject compounds have a molecular weight that 
ranges from about 200 to 1 100 g/mol, usually from about 300 to 900 g/mol and more 
usually from about 400 to 800 g/mol. 

A feature of the subject non-peptide compounds is that they structurally 

20 mimic the active loop 3-D conformation when bound by the receptor. By active loop is 
meant residues 1 1 1-1 16 or Arg-Phe-Phe-Asn-Ala-Phe (SEQ ID NO:^) of the Agouti 
Related Protein. More specifically, the subject non-peptide compounds are characterized by 
substantially structurally mimicking the backbone phi angle of ammo acid 1 13 in AGRP, 

1. e. Phel 13 phi = -55.4*\ and the U1-U2 interatomic distance (see structure below). As the 

25 subject compounds substantially structurally mimic the active loop, in 9 of 10 lowest energy 
structures calculated with distance geometry the following requirements should be m.et. 

The phi angle mimicking amino acid 1 13 of AGRP, any deviation from the 
above angles should not exceed about -90" to -10** or 10" to 90", usually about -85" to -20" or 
20" to 85", and more usually about -80" to -30" or 30" to 80" in phi space. The interatomic 

30 Ui-U: distance should not exceed about 6.5 A, usually about 5.7 A, and more usually about 
about 5.5 A. 
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In many preferred embodiments, ihe subject ni)n-peptide ligands have the 
tormula ol Formula II: 




where: each B, U,, Lb-. Ri and R2 is independently selected from the group consisting of: 
hydrogen; alkyl; derivatized alkyl, e.g. halo alkyi, alkoxyalkyi, heteroalkyl, etc.; cycloalkyi; 

20 derivatized cycloalkyi, e.g. halocycloalkyl, aloxycycloalkyl, heterocycloalkyl; aryl; 
arylalkyl; heteroaryl; or heteroarylalkyl; J is carbon, nitrogen, silicon, or sulfur; X is 
hydrogen, carbon, nitrogen, oxygen, silicon, or sulfur; Z is a continuing peptide bond; 
hydroxyl; amide of the form -NH.-, -NH-(n) or -N-(n,n'), where n or n' can be any alkyl, 
alkenyl, alkynyl, cycloalkyi, aryl, arylalkyl, heteroaryl or heteroarylalkyl, or derivatized 

25 form thereof; or ester of the form -0-(y) where y can be any alkyl, alkenyl, alkynyl, 

cycloalkyi, aryl, arylalkyl, heteroaryl or heteroarylalkyl, or derivatized form thereof. In 
certain prefered embodiments, B comprises a heteroalkyl moiety, where in certain preferred 
embodiments the heteroalkyl moiety comprises a guanidino moiety. In certain preferred 
embodiments, each U is a cycloalkyi, preferably arylalkyl, and more preferably 

30 methylbenzyl. In many preferred embodiments, the subject compounds are compounds in 
which B, R. Ri and R2 are as described above. 

Specific non-pcptide compounds of interest include the compounds of 
Formulas III, IV, V, and VI. shown below: 



35 
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The subject compounds may be synthesized using any convenient protocol. 
In one representative protocol, the peptoid portion of the subject molecules is synthesized 
using a submonomer approach, where a representative submonomcr synthesis protocol that 
may be adapted to synthesize the subject compounds is described in U.S. Patent Nos. 
5,977301 and 5,831,005; the disclosures of which are herem incorporated by reference. 
Following production of the peptoid portion, the terminal '^B" group as described above is 
added to the terminus of the peptoid portion of the compound. A representative sheme for 
preparation of the subject compounds is provided in the Experimental Section, infrd. 



V II. Uses of melanocortin receptor liiiands. 

A) Use as melanocortin receptor probes. 

The melanocortin receptor ligands of this invention are useful in a wide 
variety of contexts. Because the ligands specifically bind the melanocortin receptor, they 
can be used as probes to specifically detect/localize melanocortin receptors (in vivo, in vit 
m cell culture, etc, ). Thus, the ligands can be used to detect the presence or absence or to 
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quanlity ihc expression level of a melanocorlin reeeplor. in such applications, the rcccptor 
ligands are preferably labeled with a detectable label. 

Detectable labels suitable tor use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Uselul labels in the present invention include, but arc 
not limited to biotin for staining with labeled slreptavidin conjugate, magnetic beads (^^,[,^, 
DynabeadsTM), fluorescent dyes (e.i,^.. fluorescein, texas red, rhodamme), fluorescent 
proteins (e.g., green fluorescent protein (GFP), red fluorescent protein (RFP), and the like), 
radiolabels (e.g., 'H, '^'I, "'S, *"^C, or "'P), enzymes (e.g., horse radish peroxidase, alkaline 
phosphatase and others commonly used in an ELISA), chemiluminescent labels (e.g. 
luciferins), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 
nm diameter size range scatter green light with high efficiency) or colored glass or plastic 
(e.g., polystyrene, polypropylene, latex, etc.) beads. Other suitable labels include spin 
labels, e.g., provided by reporter molecules with an unpaired electron spin which can be 
detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels include 
organic free radicals, transitional metal complexes, particularly vanadium, copper, iron, and 
manganese, and the like. 

It will be recognized that fluorescent labels are not to be limited to single 
species organic molecules, but include inorganic molecules, multi-molecular mixtures of 
organic and/'or inorganic molecules, crystals, heteropolymers, and the like. Thus, for 
example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily 
derivatized for coupling to a biological molecule (Bruchez et al. (1998) Science, 281: 2013- 
2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium selenide) 
have been covalently coupled to biomolecules for use in ultrasensitive biological detection 
(Wan-en and Nie (1998) Science, 281: 2016-2018). 

Patents teaching the use of labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350 
3,996,345; 4,277,437; 4,275,149; and 4,366,241 . 

The label may be added to the melanocortin receptor ligand prior to, or after 
the ligand is contacted to the sample. So called "direct labels" are detectable labels that are 
directly attached to or incoiporated into the ligand prior application of the ligand to the 
sample. In contrast, so called "indirect labels" are joined to the bound ligand after it has 
contacted the sample. Often, the indirect label becomes attached to a binding moiety 
present on the ligand before it is contacted to the sample. Thus, for example, the ligand 
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may be biolinylaled before it is used in the assay. After the iigand is bound to melanocortin 
rcceptor(s) in the sample, an aMdin-con|ugaled fluorophore will bind the biotin on the 
Iigand thereby providing a label that is easily detected. 

The label is attached to the Iigand directly or through a linker moiety, in 
5 certain embodiments, the label can be conveniently attached to an amino or carboxyl 

terminus of the Iigand or to the R group of any amino acid(s) comprising the Iigand as long 
as it does not interfere with specific binding of the Iigand. 

The label is detected using a method appropriate to the nature of the label. 
Thus, for example, enzymatic labels are detected by providing the appropriate substrate and 
10 reaction conditions for the enzyme and detecting loss of substrate or increase of reaction 

product. Radioactive labels are detected, e.f^. via scintillography. Fluorescent labels and/or 
coiorimetnc labels are delected using optical methods (e.g. fluorometry, image analysis, 
etc). 

Use in altering melanocortin re ceptor activity. 

1 5 In certain preferred embodiments, the melanocortin receptor ligands of this 

invention are used to alter (modulate) melanocortin receptor activity. The Hgand can act as 
a simple competitive inhibitor blocking access to the receptor by its native Iigand. 
Alternatively the ligands of this invention can act as receptor agonists or antagonists. 

The melanocortin receptor can be modulated in vivo, or in vitro. Of 
20 particular interest is use of such compounds in a method of modulating melanocortin 

receptor activity in a mammal by administering to a mammal in need thereof a sufficient 
amount of a compound that fits spatially and preferentially into a Iigand binding site of a 
melanocortin receptor of interest. By "modulating" is intended increasing or decreasing 
activity of a melanocortin receptor. 
25 The subject compounds (e.g. mini-AGRPs and/or non-peptide ligands) find 

use in a variety of different applications, and are particularly suited for use in modulating 
MC3R and/or MC4R mediated physiological processess. As such, the subject compounds 
find use in modulating feeding behaviour, and treating disease conditions associated with 
feeding behaviour. For example. MCR agonists or AGRP binding antagonists of the subject 
30 methods find use inhibiting appetite, where such compounds find use in the treatment of 
disease conditions associated with excessive caloric intake, e.g. obesity and the like. 
Conversely, .MCR antagonists of the subject invention find use in treating disease 
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coiidilions where increasing appetite is attractive. anorexia, cachexia and the hkc. Other 
disease conditions in which the subject compounds may find use include those described in: 
WO 99/2 1 .5 1 7: WO 98/ 1 ()()68; WO 99/,S4358-. WO 99/3 1 .508: WO 99/43709: WO 
99/55679; and WO 99/57148. 
5 Other representative uses tor the subject iigands, c.t,'. screening assays for 

MCR iigands, etc., arc also disclosed in these listed patent applications, the disclosures ot 
which are herein incorporated by reference). Labeled, i.e. radio, fluorescent, biotin, antigen, 
etc., non-peptide Iigands of melanocortin receptors find use as biological tracers for receptor 
identification in vivo. 

10 The compounds of the subject invention find use in a variety of different 

hosts. Generally such hosts are "mammals" or "mammalian," where these terms are used 
broadly to describe organisms which arc within the class mammalia, including the orders 
carnivore (e.g., dogs and cats), rodentia (e.g., mice, guinea pigs, and rats), and primates 
(e.g., humans, chimpanzees, and monkeys). In many embodiments, the hosts will be 

15 humans. 

The compouds of this invention are also useful as "lead compounds" for the 
development of therapeutics. For example, pre-clinical candidate compounds can be tested 
in appropnate animal models in order to measure efficacy, ab,sorption, pharmacokinetics 
and toxicity following standard techniques known in the art. Compounds exhibiting desired 
20 properties are then tested in clinical tnals for use in treatment of various melanocortin 

receptor-based disorders. These include feeding disorders, including wasting syndromes, 
obesity, and other disorders related to hypothalamic control of feeding. A wasting 
syndrome is an illness characterized by significant weight loss accompanied by other indicia 
of poor health, including poor appetite, gut disorder, or increased metabolic rate. Wasting 
25 syndromes include, but are not limited to, the wasting syndrome afflicting some patients 
diagnosed with .Acquired Immune Deficiency Syndrome (AIDS) and various cancers. As 
methods of treating other symptoms of diseases such as AIDS progress, the incidence of 
wasting syndrome as the cause of death increases. Improved prophylaxis and treatment for 
HIV wasting syndrome is required (Kravick, et al.. Arch, liucrn. Med. 157:2069-2073, 
30 1997). Anorexia and cachexia are well-known results of cancer that contribute to morbidity 
and mortality (Simons, et al. Cancer 82:553-560. 1998: Andrassy & Chwals. Nutrition 
14:124-129. 1998). The reasons for the significant weight loss are multiple and may be 
directly related to the tumor, such as increased metabolic rate, but also include decreased 
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intake due lo poor appetite or gut involvement. F\irther, exeessivc leptin-iike signaling may 
contribute to the pathogenesis of uasting illness (Schwartz, et al.. Pro, Nutr. Sac. 36:78.^- 
791, 1997). 

Accordingly, one aspect oi the invention pertains to a method of treating a 
5 disease state in mammals that is alleviated by treatment with a polypeptide having an amines 
acid sequence: CX'x'X-'X^X^'x'rX^X'x'x''X^ 'x'^CCDPX^^'ATCYCX^^X'^X^'N AFC 
YCRn (SEQ ID NO:_), wherein X\ X\ X\ X\ X\ X^ X^ X^ X^ X'^ X'\ X'\ X'\ X'\ 
X^\ and n is 0 or 1, which method comprises administering to a mammal in need of such a 
treatment a therapeutically effective amount of the polypeptide, which can be administered, 

10 by way of illustration and not limitation, in a liquid formulations or a solid formulations, 
such as in the form of a pharmaceutically acceptable salt thereof. In one prefeired 
embodimiCnt, the polypeptide has the amino acid sequence: CVRLIIESCI .GQQVPCC 
DPAATCYCRFFNAFCYC (SEQ ID NO:3). In certain embodiments, such a disease state 
can be a wasting syndrome. 

15 There are many other uses and advantages provided by the present invention. 

For example, the methods and compositions described herein are useful for identifying 
peptides, peptidomimetics or small natural or synthetic organic molecules that modulate 
melanocortin receptor activity. The compounds are useful in treating melanocortin 
receptor-based disorders. Methods and compositions of the invention also find use in 

20 characterizing structure/function relationships of natural and synthetic ligand compounds 

VIII. Screening of peptidomimetics and/or mini-AGRPs. 

A) Binding and activity assays. 

The compounds of this invention (e.g. mini-AGRPs or peptidomimetics) or 
libraries thereof can be screened to identify those having a particular receptor specificity, 

25 and/or avidity, and/or activity (e.g. agonist, antagonist, etc.). Such screening methods can 
readily identify "lead compounds" displaying the desired properties. Once lead compounds 
are identified, a variety of homologs and analogs can be prepared, e.g. to facilitate an 
understanding of the relationship between chemical structure and biological or functional 
activity. These studies define structure activity relationships which are then used to design 

30 drugs with improved potency, selectivity and pharmacokinetic properties. Combinatorial 
chemistry can also be used to rapidly generate a variety of structures for lead optimization. 
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In prereiTcd embodiments, such assays include binding assays and/or activity 
assays. Binding assays typically measure the ability of the ligand(s) of interest to bind a 
melanocorlin receptor (c^,l,^ MC3r, MC4r, etc.). The assays can be qualitative (positive or 
negative) or quantitative, c.i^. provide a measure of binding affinity. 
5 Suitable binding assays are well known to those of skill in the art. In a 

preferred embodiment, the binding assay is a competitive assay that measures the ability of 
the test composition to displace/compete a reference ligand from a melanocortin receptor. 
In one preferred approach specific binding of ligands of this invention to cells expressing 
the MC-3 receptor is determined by competition experiments using labeled (e.g. ^'^^l- 
10 labeled) Nle'^-D-Phe^-a-MSH (NDP-MSH), as described m Tatro et al (1990) Cancer Res. 
50: 1237-1242). 

In various embodiments, screening can be in vitro and^or in vivo. Certain 
preferred assays include cell-free competition assays and cell culture based assays. 

Activity assays measure the ability of the compounds of this invention to 

15 activate (agonize) or inhibit (antagonize) activity at one or more melanocortin receptors or 
to block or augment the activity of known antagonists or antagonists. Activity assays for 
melanocortin receptors are well known to those of skill in the art. 

Preferred assays measure, directly, or indirrectly, melanocortin receptor 
induced changes in intracellular cAMP concentrations. One such assay measures the ability 

20 of cAMP to displace (8-'H) cAMP from a high affinity cAMP binding protein (see Gilman 
(1979) Proc\ Natl Acad. Sci„ USA, 67: 305-312). Briefly, test cells are exposed to the 
ligands to be screened. Following treatment, the cells are washed twice with phosphate 
buffered saline and intracellular cAMP extracted by lysing the cells with 1 mL of 60% 
ethanol. The assay is then run as described by Gilman, supra.). 

25 Another convenient assay system is described in U.S. patent 6,100,048. This 

patent discloses recombinant expression constructs comprising nucleic acid encoding 
mammalian melanocortin receptors, and mammalian cells into which the recombinant 
expression constructs have been introduced that express functional mammalian 
melanocortin receptors. A panel is provided of such transformed mammalian cells 

30 expressing melanocortin receptors for screening compounds for receptor agonist and 

antagonist activity. One typical panel includes MClr, MC2r, MC3r, and MC4r. In various 
embodiments, each of the cells of the panel of mammalian cells expressing mammalian 
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mclanocortin rcccpH)rs further comprises a recombinant expression construct encodmg a 
cyclic AMP responsive element (CRE) transcription factor binding site that is operatively 
linked to a nucleic acid sequence encoding a protein capable of producing a delectable 
metabolite. Expression of the protein that produces the detectable metabolite is dependent 
5 nn binding of the lest compound to the mclanocortin receptor expressed by each cell in the 
panel and the intracellular production of cAMP as a result. Detection of the metabolite 
thereby provides a measure of the agonist or antagonist activity of the ligand(s) in question. 

In a preferred embodiment, compounds of the invention bind to a 
mclanocortin receptor ligand binding site with greater affinity than the native cellular ligand 
10 proteins. Preferred compounds show at least a 1.25 fold greater affinity, preferably at least 
a !.5 fold greater affinity, more preferably at least a 2 fold greater affinity, and most 
preferably at least a 5 fold or iO fold greater affinity than the native cellular ligand proteins. 

Preferred compounds can additionally or alternatively, show a different 
receptor specificity, e.g. specificity for MC4r rather than specificity for MC3r and MC4r, 
15 etc. The compounds selected can have agonist and/or antagonistic properties. The 

compounds also include those that exhibit new properties with varying mixtures of agonist 
and antagonist activities, depending on the effects of altering ligand binding in the context 
of different activities of mclanocortin receptors which are mediated by proteins other than 
ligands, and which interact with the receptors at locations other than the ligand binding site. 
20 The compounds also include those, which through their binding to receptor locations that 
are conformationally sensitive to ligand binding, have allosteric effects on the receptor by 
stabilizing or destabilizing the ligand-bound conformation of the receptor, or by directly 
inducing the same, similar, or different conformational changes induced in the receptor by 
ligand binding. 

25 The assays described herein are meant to be illustrative and not limiting. 

Using the teaching provided herein, other suitable assays will be apparent to those of skill in 
the art. 

In certain prefeired embodiments, the compounds that s 

B) High throughput screening 

30 The activity or binding assays of this invention are also amenable to "high- 

throughput" modalities. As described above preferred ligands of this invention bind to 
mclanocortin receptors and thereby modulate receptor activity. Preferred assays detect MC 
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receptor binding and/or activity. High throughput assays for the presence, absence, or 
quanlification of particular products arc well known lo those of skill in the art. Similarly, 
binding assays are similarly well known. Thus, lor example. U.S. Patent .5,.S59,4I0 and 
.'S, 585, 639 discloses high throughput binding assays. 
5 In addition, high throughput screening systems are commercially available 

(see, e.M-, Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor. OH: Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures including all sample and reagent pipetting, liquid 
dispensing, limed incubations, and final readings of the microplate in detector(s) appropriate 
10 for the assay. These configurable systems provide high throughput and rapid start up as 
well as a high degree of flexibility and customization. The manufacturers of such systems 
provide detailed protocols the various high throughput. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 
15 It is noted that HTS can be efficiently accomplished by screening 

combinations of ligands in a single assay. Thus, for example, a 680 well microtitre plate 
could contain 100 different ligands per well providing 68,000 different ligands per plate. 
Those wells that show a positive score for a particular assay are then sequentially 
subsampled to identify the particular ligand(s) providing the positive signal. 

20 C) Databases of melanocortin recepto r modulators. 

In certain embodiments, the agents (ligands) that score positively in the 
assays described herein {e.g. show an ability to bind and/or modulate a melanocortin 
receptor) can be entered into a database of putauve and/or actual modulators of Socs2 
expression. Alternatively or additionally, a specificity fingerprint of a putative or acctual 
25 modulator can be entered into the database. 

The term database refers to a means for recording and retrieving information. 
In prefcned embodiments, the database also provides means for sorting and/or searching the 
stored information. The database can comprise any convenient media including, but not 
limited to. paper systems, card systems, mechanical systems, electronic systems, optical 
30 systems, magnetic systems or combinations thereof. Prefcned databases include electronic 
ie.i^. computer-based) databases. Computer systems for use in storage and manipulation of 
databases are well known to those of skill m the art and include, but are not limited to 
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"personal computer systems", mainframe systems, distributed nodes on an mter- or intra- 
net, data or databases stored in specialized hardware ie.i^. in microchips), and the like. 

IX. Methods for identifying receptor bindinu reuions of melanocortin receptor 
agonist and antagonists. 

The invention also includes compositions and methods for identifying 
receptor binding regions of melanocortin receptor agonist and antagonists, along with ligand 
binding sites of melanocoitin receptors. The methods involve examining the surface of a 
polypeptide of interest to identify residues that modulate ligand binding. The residues can 
be identified by homology to the receptor binding region of MARP described herein. A 
preferred method is alignment with the residues of any polypeptide corresponding to (i.e., 
the same as or equivalent to) residues 1-18 of the N-terminal loop (residues 1-18 of SEQ ID 
NO:2), residues 19-34 of the central loop (residues 19-34 of SEQ ID NO:2) and residues 
35-46 of the C-terminal loop (residues 35-46 of SEQ ID NO:2). Overlays and 
superpositioning with a three-dimensional model of the MARP receptor binding region, or a 
portion thereof that contains a receptor binding region, also can be used for this purpose. 
For example, melanocortin receptor agonists and antagonists identifiable by homology 
alignment include naturally occurring compounds or compounds structurally related to such 
naturally occurring compounds found in humans, along with synthetic compounds. 

Alignment and/or modeling also can be used as a guide for the placement of 
mutations on the receptor binding region surface to characterize the nature of the ligand 
binding site on melanocortin receptors in the context of a cell. To destroy the ligand 
binding interaction, prefen-ed mutations are to charged residues (e.g., Arg, Lys, or Glu) on 
the basis that bulky, surface charged residues might disrupt ligand binding, yet preserve the 
overall ligand structure and solubility. Mutants can be tested for ligand binding as well as 
the relative change in strength of the binding interaction. Ligand-dependent ligand 
interaction assays also can be tested for this puipose, such as those described herein. 

Compounds that bind to the ligand binding site of melanocortin receptors can 
be identified by computational modeling and/or screening. For example, ligand agonists or 
antagonists can be identified by providing atomic coordinates comprising the MARP 
receptor binding region or portion thereof to a computerized modeling system, modeling 
them, and identifying compounds that mimic or match the receptor binding region and thus 
would be expected to fit spatially into the ligand binding site. By a "portion thercot" is 



-41- 



4{)7T-98(Wl()rS 

intended the atomic coordinates corresponding to a sutTicicnt number of residues or their 
atoms of the receptor binding region that interact w ith a melanocortin receptor capable of 
binding the region. As another example, an atomic structural model utilized tor 
computational modeling and/or screening of compounds that mimic or match the receptor 

5 binding region and thus would be expected to fit spatially into the ligand binding site, may 
include a portion of atomic coordmates of amino acid residues corresponding to the region 
composed of residues 24-3 1 of the central loop (residues 24-3 1 of SEQ ID NO:2), or their 
structural and functional equivalents. An atomic model can also be designed that includes 
residues 19-34 of the central loop (residues 19-34 of SEQ ID NO:2), and a model can 

10 further be designed that includes residues 19-34 of the central loop (residues 19-34 of SEQ 
ID NO:2) and some or all of residues 1-18 of the N-terminal loop (residues 1-18 of SEQ ID 
NO:2). Thus, for example, the atomic coordinates provided to the modeling system can 
contain atoms of MARP, all or part of the receptor bmding region or a subset of atoms 
useful in the modelmg and design of compounds that mimic or match the receptor binding 

15 region. 

X. Pharmaceutical Preparations 

Also provided are pharmaceutical preparations of the subject non-peptide 
compounds. The subject compounds can be incorporated into a variety of formulations for 
therapeutic administration. More particularly, the compounds of the present invention can 

20 be formulated into pharmaceutical compositions by combination with appropriate, 

pharmaceutically acceptable carriers or diluents, and may be formulated into preparations m 
solid, semi-solid, liquid or gaseous forms, such as tablets, capsules, powders, granules, 
ointments, solutions, suppositories, injections, inhalants and aerosols. The formulations may 
be designed for administration via a number of different routes, including oral, buccal, 

25 rectal, parenteral, intraperitoneal, intradermal, transdermal, mtracheal, etc., administration. 

In pharmaceutical dosage forms, the compounds may be administered in the 
form of their pharmaceutically acceptable salts, or they may also be used alone or in 
appropriate association, as well as in combination, with other pharmaceutically active 
compounds. The following methods and excipients are merely exemplary and are in no way 

30 limiting. 

For oral preparations, the compounds can be used alone or in combination 
with appropriate additives to make tablets, powders, granules or capsules, tor examiple, with 
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conventional addilives, such as lactose, mannitoK corn starch or potato starch; with binders, 
such as crystalline cellulose, cellulose deruativcs. acacia, corn starch or gelatins; with 
disintegrators, such as corn starch, potato starch or sodium carboxymeth ylccllulosc; with 
lubricants, such as talc or magnesium stearate; and if desired, with diluents, buffering 
agents, moistening agents, preservatives and flavoring agents. 

The compounds can be formulated inlo preparations for injection by 
dissolving, suspending or emulsifying them in an aqueous or nonaqueous solvent, such as 
vegetable or other similar oils, synthetic aliphatic acid glycerides, esters of higher aliphatic 
acids or propylene glycol; and if desired, with conventional additives such as solubilizers, 
isotonic agents, suspending agents, emulsifying agents, stabilizers and preservatives. 

The compounds can be utilized in aerosol formulation to be administered via 
inhalation. The compounds of the present invention can be formulated into pressurized 
acceptable propellants such as dichlorodifluoromethane, propane, nitrogen and the like. 

Furthermore, the compounds can be made into suppositories by mixing with 
a variety of bases such as emulsifying bases or water-soluble bases. The compounds of the 
present invention can be administered rectally via a suppository. The suppository can 
include vehicles such as cocoa butter, carbowaxes and polyethylene glycols, which melt at 
body temperature, yet are solidified at room temperature. 

Unlike typical peptide formulations, the peptides of this invention 
comprising D-form amino acids can be administered, even orally, without protection against 
proteolysis by stomach acid, etc. Nevertheless, in certain embodiments, peptide delivery can 
be enhanced by the use of protective excipients. This is typically accomplished either by 
complexing the polypeptide with a composition to render it resistant to acidic and enzymatic 
hydrolysis or by packaging the polypeptide in an appropriately resistant carrier such as a 
liposome. Means of protecting polypeptides for oral delivery are well known in the art (see, 
e.^., U.S. Patent 5,391,377 describing lipid compositions for oral delivery of therapeutic 
agents). 

Where the ligands of this invention are polypeptides for oral administration, 
it is desirable to protect the polypeptide from digestion. This is readily accomplished by a 
variety of encapsulation technologies. One prefen^ed encapsulation system is a "sustained- 
release system". 

Such sustained release systems are well known to those of skill in the art and 
can often maintain elevated scrum half-life. In one preterrcd embodiment, the ProLease 
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biodegradable microsphere delivery system lor protcms and peptides (Tracy (1998) 
BioU'clmoL Proii. 14; 108; Johnson et al, (1996) Naimr Med. 2: 795: Herbert et al. (1998). 
Plumnaccut. Res. 15, 3.57) a dry powder composed of biodegradable polymeric 
microspheres containing the protcm in a polymer matrix that can be compounded as a dry 
5 formulation with or without other agents. 

The ProLeasc microsphere fabrication process was specifically designed to 
achieve a high protein encapsulation efficiency while maintaining protein integrity. The 
process consists of (i.) preparation of freeze-dried protein particles from bulk protein by 
spray freeze-drying the drug solution with stabilizing excipients, (ii) preparation of a drug- 
10 polymer suspension followed by sonication or homogenization to reduce the drug particle 
size, (iii) production of frozen drug-polymer microspheres by atomization into liquid 
mtrogen, (iv) extraction of the polymer solvent with ethanol, and (v) filtration and vacuum 
drying to produce the final dry-powder product. The resulting powder contains the solid 
form of the protein, which is homogeneously and rigidly dispersed within porous polymer 
1 5 particles. The polymer most commonly used in the process, poly(lactide-co-glycolide) 
(PLG), is both biocompatible and biodegradable. 

Encapsulation can be achieved at low temperatures {e.g., -40"C). During 
encapsulation, the protein is maintained in the solid state in the absence of water, thus 
minimizing water-induced conformational mobility of the protein, preventing protein 
20 degradation reactions that include water as a reactant, and avoiding organic-aqueous 

interfaces where proteins may undergo denaturat.on. A preferred process uses solvents in 
which most proteins are insoluble, thus yielding high encapsulation efficiencies (e.g., 
greater than 95%). 

Unit dosage forms for oral or rectal administration such as syrups, elixirs. 

25 and suspensions may be provided wherein each dosage unit, for example, teaspoonful, 

tablespoonful, tablet or suppository, contains a predetermined amount of the composition 
containing one or more inhibitors. Similarly, unit dosage forms for injection or intravenous 
administration may comprise the inhib.tor(s) in a composition as a solution in sterile water, 
normal saline or another pharmaceutically acceptable carrier. 

30 The term "unit dosage fonn.'- as used herein, refers to physically discrete 

units suitable as unitary dosages for human and animal subjects, each unit containing a 
predetermined quantity of compounds of the present invention calculated in an amount 
sufficient to produce the desired effect in association with a pharmaceutically acceptable 
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dikicnt. carrier or vehicle. The speciticalions for the novel unit dosage forms of the present 
mvenlion depend on the particular compound employed and the effect to be achieved, and 
the pharmacodynamics associated with each compound in the host. 

The pharmaccutically acceptable excipicnts. such as vehicles, adjuvants. 
5 carriers or diluents, arc readily available to the public. Moreover, pharmaceutically 
acceptable auxiliary substances, such as pH adjusting and buffering agents, tonicity 
adjusting agents, stabilizers, wetting agents and the like, are readily available to the public. 

In therapeutic applications, the compositions of this invention are 
administered to a patient suffering from one or more symptoms of a wasting syndromes, 
10 obesity, and other disorders related to hypothalamic control of feeding. Such conditions 
include, but are not limited to wasting syndrome accompanying AIDS and various cancers. 
The compounds are administered in an amount sufficient to cure or at least partially prevent 
or arrest one or more symptoms of the the disease and/or its complications. An amount 
adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
15 effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be 
administered depending on the dosage and frequency as required and tolerated by the 
patient. In any event, the composition should provide a sufficient quantity of the active 
agents of the formulations of this invention to effectively treat (ameliorate one or more 

20 symptoms) the patient. 

The concentration of ligand (polypeptide or peptiodmimetic) can vary 
widely, and will be selected pnmanly based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs. Concentrations, however, will typically be selected to provide dosages ranging from 

2.S about 0. 1 mg/kg/day to about 1000 mg/kg/day. preferably from about 1 mg/kg/day to about 
100 mg/kg/day. more preferably from about 5 mg/kg/day to about 50 mg/kg/day. It will be 
appreciated that such dosages may be varied to optimize a therapeutic regimen in a 
particular subject or group of subjects. 
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XI. Kits. 

In still anolher embodiment, this invention provides kits for practice of the 
methods described hcrem. In certain embodiments the kits comprise a container containing 
one or more of the melanocortin Hgands of this invention. The ligands can be labeled or 
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unlabeled, and/oi\ optionally provided in a unit dosage form, and/or optionally, provided 
with or in a pharmacological acceptable excipient. The kits can optionally additionally 
include one or more detectable labels Tor labeling the ligand(s). 

Certain preferred kits provide libraries of the ligands of this invention as 
5 described herein. 

The kits can optionally include any reagents and/or apparatus to facilitate 
practice of the methods described herein. Such reagents and apparatus include, but are not 
limited to buffers, instrumentation, devices for administering the ligand(s) {e,g. syringes, 
etc.), microliter plates, labeling reagents streptavidin or biotin conjugated substrates, PAGE 

10 gels, blotting membranes, reagents for detecting a signal, and the like. 

In addition, the kits may include instructional materials containing directions 
{i.e., protocols) for the practice of the methods of this invention. Preferred instructional 
materials provide protocols for utilizing the kit contents for modulating melanocortin 
receptor activity and/or for screening for particular melanocortin receptors, and/or for 

15 treating a disease or pathological state. While the instructional materials typically comprise 
written or printed materials they are not limited to such. Any medium capable of storing 
such instructions and communicating them to an end user is contemplated by this invention. 
Such media include, but are not limited to electronic storage media (e.g., magnetic discs, 
tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such media may 

20 include addresses to internet sites that provide such instrucdonal materials. 

XIL Computer-based methods. 

The invention also provides for the NMR structure of the human AGRP C- 
terminus, preferably embodied in a computer readable form. Synthesis of MARP and 
concentrated solutions adequate for NMR spectroscopy are described in the examples. 

25 After the NMR structure of MARP is determined, the structural information can be used in 
computational methods to design synthetic agonist and antagonist compounds for the 
melanocortin receptors, and further structure-activity relationships can be determined 
through routine testing using the assays described herein and known in the art. It is 
expected that the structure coordinates of the NMR structure of MARP, as provided in 

30 Tables 4 and 5, will be pailiculaiiy useful for solving the NMR structure of other 

melanocortin receptor antagonists or agonists. The coordinates in Table 4 and Table 5 
represent the minimized average of the coordinates of a family ot NMR structures. The 
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m.nimi/.cd average was determined by ( I ) calculating an average from approximately 20 
NMR structures lound to meet the experimental NMR distance restraints and (2) subjecting 
this calculated average to a final round of energy minimization. 

One method that may be employed for this purpose is molecular 
replaccm.ent. In this method, the unknown NMR structure, may be determined using the 
structure coordinates of this invention as provided in Table 4 and Table 5. The coordinates 
in Table 4 for the human AGRP C-terminus have been deposited with the Brookhaven 
National Laboratory Protein Data Bank, and have been assigned Brookhaven Protein Data 
Bank Accession Number lqu8. The coordinates for Table 5 have been deposited with the 
Brookhaven National Laboratory Protein Data Bank, and have been assigned Brookhaven 
Protein Data Bank Accession Number IHYK. This method will provide an accurate 
structural form for the unknown NMR structure more quickly and efficiently than 
attempting to determine such information ab initio. 

One aspect of the invention is an NMR structure of the minimized agouti 
related protein, embodied in a computer readable media. Atomic coordinate information 
gleaned from the NMR structure of the invention is preferably stored and provided in the 
form of a machine-readable data storage medium. This medium contains information for 
constructing and/or manipulating an atomic model of a receptor binding region or portion 
thereof. Fo^r example, the machine readable data for the receptor binding region may 
compnse structure coordinates of amino acids corresponding to {i.e.. the same as or 
equivalent to) residues 1-18 of the N-terminal loop, (residues 1-18 of SEQ ID NO:2), and/or 
residues 19-34 of the central loop (residues 19-34 of SEQ ID NO:2) and/or residues 35-46 
of the C-terminal loop (residues 35-46 of SEQ ID NO:2), or a homologue of the molecule or 
molecular complex comprising the region. The machine readable data for the receptor 
binding region may compnse structure coordinates of amino acids corresponding to residues 
118 of the N-term,na! loop, (residues 1-18 of SEQ ID NO:2) and residues 19-34 of the 
central loop (residues 19-34 of SEQ ID NO:2). The machine readable data may also 
compnse residues 24-31 of the central loop (residues 24-31 of SEQ ID NO:2) and a portion 
of the N-terminal loop, for example residues 15 to 18 (residues 15-18 of SEQ ID NO:2), 
residues7to 18 (residues 7-18 of SEQ ID NO;2), or residues 1 to 18(residues l-18ofSEQ 
ID NO:2). The homologues compnse a receptor binding region that has a root mean square 
deviation from the backbone atoms of the amino acids of not more than 2.54A. preferably 
not more than 1.66 A. 
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The machine-readable data storage medium can be used lor mtcractive drug 
design and molecular replacement studies. For example, a data storage material is encoded 
with a first set of machine-readable data that can he c(Mnbined with a second set of machine- 
readable data. For molecular replacement, the first set of data can comprise a Fourier 
transform of at least a portion of the structural coordinates of the AGRP C-terminus or 
portion thereof of interest, and the second data set comprises an X-ray diffraction pattern o\ 
a molecule such as the melanocortin receptor of interest. Using a machine programmed 
with instructions for using the first and second data sets a portion or all of the structure 
coordinates corresponding to the second data can be determined. 

EXAMPLES 

The following examples are offered to illustrate, hut not to limit the claimed 

invention. 

Example 1 

Chemical synthesis and characterization of MARP 



Abbreviations 

AGRP/ART 

Conformational shift 

DQF-COSY 

HX 

ICK 

MARP 

MC3r/MC4r 

MRE 

a-MSH 

NDP-MSH 

nOe 

NOESY 
TOCSY 



three bond aH-NH scalar coupling constant 
Agouti Related Protein/Agouti Related Transcript 
experimental chemical shift-random coil chemical shift 
two-dimensional double-quantum filtered correlation spectroscopy 
hydrogen-deuterium exchange 
Inhibitor Cystine Knot 

N4inimized human Agouti Related Protein, residues 87-132 of human 
AGRP 

Melanocortin Receptor % 
Mean Residue EUipticity 
cx-Melanocyte Stimulating Hormone 

[Nle^ D-Phe']a-MSH, a superpotent melanocortin agonist 
melanocortin Overhauser enhancement 
two-dimensional melanocortin Overhauser spectroscopy 
t\vo-dim.ens!onal total con-elation spectroscopy 
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Methods and Materials 



A) Chemical protein synthesis: 

//'-Acctyl-MARP (SEQ ID NO:2) and yV"-iicctyl-MARP(Arg25Ala) were 
synthesized, folded and purified to each give a protein eontaining five disulfide bonds, as 
5 reported in Yang, et ai., supra. 

B) CD experiments: 

Circular dichroism ("CD") spectra were recorded at 25"C on an Aviv 60DS 
spectropolarimeter in a rectangular 1 mm path length cuvet for concentrations up to 60 ^M. 
For concentrations higher than this a round cell with a 0.1 mm path length was used. All 
10 CD samples were 50 mM potassium phosphate, pH 4.25. Concentration dependence was 
ruled out in the range 20 |^M-1 mM. Temperature dependence was determined for 5-85"C. 
The spectra are superimposable from 5 to 45"C. 

C) NMR samnle preparation: 

The activity of MARP used for the NMR sample and that of a single mutant 
1 5 were assayed by measuring the inhibition of cAMP production in the presence of NDP- 

MSH (Shutter, el al., supra; Iluszar, ct al., supra) in HEK-293 cells transfected with human 
MC4r. Control experiments were performed with no MARP. The NMR samples were 
found to be native-like with complete activity. NMR samples contained approximately 1.9 
mM MARP at pH 4.2 in 50 mM KH2PO4 buffer in 90% H2O/10% D.O. Additional samples 
20 for HX experiments were prepared by lyophilization of protonated samples followed by 
reconstitution in 700 |J.l D2O. 

NMR experiments: 

'H 2D NMR spectra were principally acquired at 15"C on a Varian 500 Unity 
Plus .spectrometer using inverse probes. Initial NMR data were routinely acquired with a 
6000 Hz spectral width, 4096 complex points in /:, and 512 (TOCSY/DQF-COSY) or 700 
(NOESY) increments in //. All spectra were processed using the MNMR package 
(Carlsberg Laboratory, Department of Chemistry, Denmark) and analyzed using XEASY 
(Bands, et al.. J. Biomol. NMR 5:1-10 ( 1995)), with chemical shifts referenced to 1.4- 
dioxime al 3.743 ppm. Sequential assignments of all backbone and >m of side chain 



25 
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protons were accomplished us.ng standard methods (Wuthr.ch. K. "NMR ol Protems and 
Nucleic Adds" (John Wiley and Sons. Ncu York, 1986): Redf.eld. C. in "NMR of 
Macromolecules: A Practical Approach, pp. 71-99 (Roberts, G.K.C., Ed.. IRL Press at 
Oxford University Press. Oxford. 199.1)) for 50 ms TOCSY, 150 ms NOESY and DQF- 
5 COSY data. Additional data sets were acquired at 25"C and 3()"C to resolve ambiguities. 
Examination of the three Pro residues identified nOes consistent only with trans-Pro. The 
same experiments were repeated at 800 mHz. 

Four additional peaks were identified in the aN region of the TOCSY 
spectrum, however, associated spin systems could not be identified and neither could nOes 

10 to the peaks in question. 

NOESY data for distance restraints were collected at 15"C using the WET 
sequence (Smallcombe, et al., ./. Ma,n. Reson. Ser. A 1 17:295-303 (1995)) for water 
suppression, 1 .6 s recycle delay and a mixing time of 80 ms in both H^O and D^O. 
^HNaCouphng constants at 25°C were determined by both linear least squares fitting of the 
15 antiphase doublets m a DQF-COSY and also usmg the INFIT (Szypersk,, et al., J. Magn. 
Reson. 99:552-560 (1992)) module of XEASY with 150 ms NOESY data. These methods 
agreed to withm ±0.5 Hz for all of the measured coupling constants. At 15"C larger 
intrinsic linewidths precluded accurate measurement of ' ^HNa. 

For amide exchange experiments, the magnet was preshimmed on a 21 
20 residue peptide sample at pH 4 in D.O/phosphate buffer. The first TOCSY expenment was 
begun 23 minutes after reconstituting the protonated sample in D.O. Four TOCSY 
experiments identical to those described above, except for the number of increments, were 
acquired back to back at 15°C over a period of 24 hours. The first three consisted of 150 t, 
.ncrements and the final expenment 300 increments. NOESY and DQF-COSY spectra 
25 were also acquired as described above. 

F.. Structure calculations: 

Final structure calculations included the covalent connectivity of the 
published disulfide map (Bures. et al.. supra), and were based on a total of 414 inteipro.on 
distance constraints derived from the 80 ms 2D NOESY spectra and 34 backbone O 
30 dihedral anele constraints derived from coupling constant measurements, giving a total of 

448 total restraints, or 9.7 restraints per residue. The distance restraints can be broken down 
into 228 intraresidue (backbone to side chain only). 129 sequential. 20 medium range ( K,/- 
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/,<5) and 37 long range (,/-/,>?) rcstramts. These restraints were assigned as strong, 
medium or weak. Tiie total numbers of restramts m each category were 95 strong. 246 
medium, and 77 weak. All categories had a lower limit of 1 .6A. with upper limits of 2.8, 
3.5 and 5 A for the strong, medium and weak categories, respectively. Trial structures were 

5 generated using the simulated annealing protocol from CNS version 0.4a (anncal.inp) with 
SUM averaging for the nOc distances (Brunger, ct al., Acta Cry.slallogr. Sect. D Biol. 
Cry'-stallogr. 54:905-921 (1998); Nilges. et al., FEBS Lett. 239: 1 29- 136 (1988); Nilges, ct 
al., Protein En^. 2:27-38 ( 1988); Kuszewski, et al., J. Magn. Reson. Ser. B 1 12:79-81 
(1996); Stein, et al., J. Magn. Reson. 124:154-164 (1997)). Twenty structures with no bond 

10 or nOe angle violations were used to represent the solution structure of MARP (see Table 1 
for Root Mean Square Deviations. "RMSDs" ). Structures were displayed using MOLMOL 
(Koradi, et al., Science 278: 135-13S (1997)). 

Results and Discussion. 

The synthesis and biochemical charactenzation of MARP were previously 
15 reported in Yang, et al., supra. The N-terminal residue (Cys-1) of MARP corresponds to 
the first Cys (Cys-87) of the Cys-nch region in full length, 132 residue human AGRP. The 
material used for the MARP NMR sample showed native-like activity as measured by its 
ability to competitively inhibit NDP-MSH at MC4r, as has been shown in previous studies 
(Yang et al., supra; Ollmann, et al., Sctence 278:135-138 (1997)). Previous mutational 
20 studies of agouti and AGRP showed residues Arg-25, Phe-26, Phe-27 (the RFF triplet, Tota, 
et al supra) and Asp-17 (in agouti) to be determinants of receptor binding (Kiefer, et al., 
Bwchentistry 36:2084-2090 (1997); Kiefer. et al., Biochennstry 37:991-997 (1998); Tota, et 
al., supra)). 

This experiment measured inhibition of NDP-MSH stimulated cAMP 
05 eeneration in cells transfected with MC4r. A control experiment was conducted with no 

MARP present. The addition of MARP (5.0 x lO " M) demonstrated competitive mhibit.on 
of NDP-MSH. The addition of MARP with Arg-25 to Ala substitution (5.0 x 10 ^" M) 
demonstrated loss of inhibUion due to a mutation in the active loop. This experiment thus 
showed that the replacement of Arg-25 by Ala results in a complete loss of inhibitory 
30 activity. 

The far-UV CD spectrum of MARP was measured in 50 mM phosphate 
buffer at pi I 4.25 as a function of temperature with spectra shown every 5=C from 5=C to 
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30°C and also at 45°C. The speclium was similar lo that reported tor a similar C-lerminal 
tragmcnl (residues 8S-132) of AGRP (residues 85-132 of SEQ ID NO: I ) reported by 
Rosenfeld. et al.. supra, and .s characteri/.ed by a negative maximum at approximately 198 
nm and a slight negative maximum at 245 nm. There was little indieation of canonical a- 

5 helix, however the spectrum did suggest s.^me p character or turns. The ncar-UV CD 
spectrum, indicative of tertiary structure, showed a weak minimum at approximately 275 
nm which could be attributed to the disulfide bonds and possibly restncted orientations of 
the side chains of the Tyr-23 and Tyr-32. 

At 25"C between 20 \xM and 1.0 mM there was no observable concentration 

0 dependence as measured by CD. Between 1 mM and 1 .9 mM there were no concentration 

hifts. etc. all remain 
dependent changes in tne NMR spectra w.t . ,ii.^vvi^i..s, l,u...i-.<i 

constant). The far-UV CD spectrum remained constant between 5"C and 45"C, a 

temperature range well beyond that of the present NMR experiments. Thus, by all 

indications, MARP existed as a monomer and did not exhibit temperature dependence under 

15 the conditions of the NMR expenments. The characteristics of the NMR spectra were 

indicative of a well folded protein with a single predominant conformer. "VHNa coupling 

constants and temperature coefficient also indicated a fully folded, non-random coil 

confonner as shown in Table 1 below, which shows the MARP coupling constants at 25"C 

and NH temperature coefficients in 50 mm phosphate buffer at pH 4.2. 

90 Table 1. Vhn« coupling constants an d temperature coefficient s. _ 



Cys-1 

Val-2 

Arg-3 

Leu-4 

His-5 

Glu-6 

Scr-7 

Cvs-8 



8.79 
7.00 
5.64 
7.50 
9.34 

9.92 



Leu-9 -^-^O 
Glv-10 



Gln-1 1 
Gln-12 
Vai-13 



8.14 
8.00 
9.03 



6.93 

3.63 

6.37 

3.47 

2.58 

2.90 

5.86 

3.84 

2.43 

7.06 

4.94 

4,38 

5.56 



4()7T-'-)8()91()l'S 



-^^^^ WcHM^lTn^^umT(H/.) NH tempcralmc coctT.c.cnl 

(ppb/ C) 



Thr-38 



9.64 



Pro- 14 
Cys-15 
Cys-16 
Asp- 17 
Pro- 18 
Cys-19 
Ala-20 
Thr-21 

Cys-22 5-76 
Tyr-23 

rv<;-24 

Arg-25 ^"^^ 
Phe-26 ^"^^ 
Phe-27 3.68 

7.82 



Asn-28 

Ala-29 ''■^^ 

Phe-30 "^-5^ 

Cys-31 9.17 

Tyr-32 9.49 
Cys-33 
Arg-34 



9.34 
7.25 



Ala-39 4.48 
Met-40 '^■^^ 



Asn-41 
Pro-42 
Cvs-43 

Ser-44 ^.26 
Arg-45 
rhr-46 



8.19 

7.24 
7.26 
7.54 



().()() 

4.33 ^'-74 

5.42 ^■^'^^ 

4.81 

0.00 

8.81 3.84 



3.23 



9.36 4.04 

7.56 
2.01 
3.14 
2.93 
3.67 
6.21 
5.66 
2.10 
4.84 
5.84 
4.43 
1.13 
2.74 
8.80 



7 74 6.43 
^--^^ 7.00 

8.07 2.07 



7.67 
2.98 
1.08 
0.00 
7.86 
8.38 
4.47 
4.61 
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Example 2 
Structural description of MARI* 

The m.nimi/cd average NMR stiucluie MARP is shown in Figure 1. 
Consistent with the lar-LI V CD spectrum. MARP showed l.tllc evidence of helical or 

, , , ,....,...,,„-„ Tb» disu'fidf^ honds H- 16. 8-22. 15-33, 19-43 and 24-31 

reaular sheet ^5econdal> ^.tiUcidi>^. 1 1.^ uisu.i .u^ .,v.nj 

appeared to form a scaffold upon which the structure was apportioned into three major 
loops, which are referred to as the N-termma! loop (residues 1-18), the central loop 
(residues 19-34 ) and the C-terminal loop (residues 35-46). and are indicated in Figure 1. 
RMSDs for the individual loops are reported in Table 2 below: 

lable 2. Nummary oi ivlAKl' uat^Kbonu a..^ u^^^ j ^ > .,..1-.. ^ 
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r~ , R.4.-ihnnp RMSD* (A) Heavv atom RMSD" ( A) 

Region (residues) Backbone K.vt;^^ . 

Global (1-46) 2.54 3.26 

N- and active loops (1-34) 1-66 

N-terminal loop (1-18) 1-31 ^-^^ 

Central loop ( 19-34) 151 
Active loop (24-31) 0-69 

C-terminal loop (35-46) 2.36 3. - 



^15^mined by fitting the family of 20 NMR structures lo ttie minimized average structure 

The N-terminal and central loops weie much better defined both within the 
loops and w,th respect to each other than the C-termtnal loop. The backbone RMSD for the 
entire protein (2.54A) was of the same order as that of the C-terminal loop (2.36A), whtle 
the backbone RMSD for residues 1-34 (1.66A) was of the order of the individual N-terminal 
and central loops. To demonstrate limited backbone structure variability of the N-terminal 
and central loops, a superposition of 14 structures (selected for clarity) for residues 1-34 and 
the MARP minimized average structure (residues 1-46) is shown in Figure 2. 

Four of the five disulfide bonds are located at the base of the structure where 
they appear to pinch together the bottoms of the loops to form the "core" of the protein 
(Figure 1 ). The exception is disulfide bond 24-31 which stabilizes the central loop. The 
central loop, residues 19-34. contains the RFF triplet determined to be critical lor activity. 
This motif IS situated within an even smaller, well defined loop bound by Cys-24 and Cys- 
3 1 which IS referred to as the "active" loop. The side chain atoms of the RFF triplet 
residues are located at the surface of the protein as depicted m Figure 3. Recent 
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cxpcnmcnls further highlight the importance of this aeti vc loop. These studies demonstrate 
that short cyclic peptides corresponding to residues 24-3 I of human AGRP do in fact 
antagoni/,c MC3r and MC4r (Tt)la. el al.. sitpni). 

inspection of the family of NMR structures and consideration of the 
observed HX revealed a structure for the central loop that is best described as an inegular 
hairpin w.th a well defined loop from Cys-24 to Cys-31 (RMSD ().6A, Figure 3) and a stem 
region which is both twisted around and curved along its --axis (Figure 1). This 
charactenzation was supported by critical examination of the nOe, V„s. and chemical shift 
data. As shown in Figure 3, the active loop ,s highly constrained with the RFF triplet side 
chains exposed to solvent. Arg-25 and Phe-27 point out into the solvent, while one face of 
the Phe-26 aromatic ring rests parallel against the surface of the protein. Though the active 
loop satisfies several of the determinants for an Q-loop (Leszczynski, et al. (1986) Science 
234:849-855), the side chain orientation of Arg-25 and Phe-27 precludes its definition as 
such since Q-loop side chains generally pack within the loop of backbone atoms. 

HX experiments demonstrated that the amide protons of residues Cys-8, Ala- 
20, Thr-1, Tyr-23, Tyr-2, Cys-3 and Arg-34 are protected from exchange with solvent. To 
explore whether these results were consistent with the average structure, the program DSSP 
(Kabsch, et al., Biopolymers 22:2577-2637 (1983)) was used to identify potential hydrogen 
bonds. DSSP Identified the backbone amides of Ala-20, Thr-21, Tyr-23, and Arg-34 as 
20 potential hydrogen bond donors. In addition, solvent accessible surface area calculations 
showed that residues Cys-8 and Cys-33 were completely buned from solvent, though in the 
D.O spectrum the aN crosspeaks of these two residues overlap, thus their individual 
protection from exchange is uncertain. Tyr-32 had only %^?c solvent accessible surface area 
at the Csprotons. 

23 The NMR structure (Table 4) gave a well resolved fold, however, as 

mentioned previously, canonical helices and P-sheets were not identified on the basis of 
nOes or other protocols including the chemical shift index (Wishart, et al., R,nchcnustrv 
31: 1647-1651 ( 1992)) or V„n„ coupling constants (Wuthrich. K.. supra). 

The guidelines for these protocols assigned secondary structure on the basis 
of four or more consecutive residues with similar conformational shifts or Vhn«. Helical 
structure was characterized by V„x.<6 Hz and negative conformational shifts and (3-sheet 
bv V„N.>8 Hz and positive conformational shifts. Even in the active loop and stem region 
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of the central loop, no regular secondary structure was idcntiiicd by these criteria. 
However, the chemical shift index pointed towards a possible extended strand from residue 
31 to residue 3.5. Table 3 sets torth the Ml chemical shifts at 1 5"C. 'i^u.- at 25(rC. and the 
NH temperature coefficients. 

Table 3. MARP 'H chemical shifts at 15"C in 50 mm phosphate buffer at pH 4.2 

Residue NH(ppm) H^ipEm) HfiiijEni) Qtherj^Eml 

-^^l 8:00 500" 312:2:81 



Vai 2 8 9^ 4.18 1.94,2.08 CH'3 0.91 , 0.80 

Ara 3 891 4.01 1.67,1.95 .69, 1.82, H^3.24, 3.26, 

^ ■ NH7.31 

Leu-4 
His-5 
Glu-6 
Ser-7 
Cys-8 
Leu-9 
Gly-10 
Gin- 1 1 
Gin- 12 
Val- 13 
Pro- 14 
Cys-15 
Cys-16 
Asp- 17 
Pro- 18 
Cys-19 
Ala-20 
Thr-21 
Cys-22 
Tyr-23 
Cys-24 
Arg-25 
Phc-26 
Phe-27 
Asn-28 
Ala- 29 



8. 13 


3.72 


i .3-+, 1 .J^ 


H''1.03, CH^0.79 


0. J J 


4.25 


3.70, 1.42 


H'^7.22, "'8.51 


ft Hzl 
0 .UH- 


4.66 


2.16 


H"'2.24 






3.84 




O.Uo 


4.96 


3.57, 3.13 


H-q.44, CH^O.82 




4 08 


1.57, 1.44 


8. /o 






H"' 2.28. H'^6.86, 7.49 


O.UO 


4.24 


1.95 


0 .Do 


4.27 


2.20, 1.95 


H'' 2.30, 2.34, H'" 6.82, 7.45 


7.48 


4.47 


2.05 


CHS 0.79, 0.90 


0.0 i 


4.55 
4.92 


2.25, 1.99 
3.35, 1.74 


HM.84, 1.99, 3.65, 3.77 


9.58 


4.22 


2.62, 3.17 




8.19 


4.78 


2.63, 2.41 


H"-' 1.98, H'"* 3.89, 4.04 


8.88 


4.50 


2.35 




4.72 


2.94, 3.47 




8.01 


4.94 


1.26 




8.76 


4.58 


4.00 


CH'3 I 16 


8.94 


4.58 


2.94, 3.02 


H'^ 6.93, H' 6.74 


8.71 


4.62 


2.79 


8.30 


4.91 


3.23, 2.60 




8.29 


3.84 


1.83. 1.55 


Hq.56, 1.23, H'\3.06 


7.91 


4.74 


2.80, 3.32 


H^7.26, 7.42 


8.54 


4.19 


3.12, 3.02 


7. 17. 7.33 


8.48 


4.20 


2.39. 7.77 


\t- 6.66, 7.30 


7.75 


4.38 


1.25 
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• 
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Residue 


NH (ppm) 


H,, (ppm) 


(ppm) 


Other (ppm) 


Phc-3() 


8.38 


4.21 


3.32, 3.36 




Cys-31 


8.34 


5.63 


2.59, 3.03 




Tyr-32 


8.88 


5.20 


2.59, 2.8 1 


11 6.92, i i o.oo 


Cys-33 


8.19 


4.96 


3.21, 2.66 


IlM.57, 1.69, H^' 2.62, 2.88, 


Arg-34 


9.43 


4.75 


1.83, 1.70 










NH 7.08 


Lys-35 


9.09 


4.47 


1.82, 1.68 


H"^! .25, 1-43, H 1.64, H i.»9 


Leu-36 


8.77 


4.3640 


1.63 


H 1 .52, Cri 3 U.o2, U. /u 


Gly-37 


8.48 


4.09,4.01 




CH\3 1.21 


Thr-38 


7.82 


4.42 


4.42 


Ala-39 


8.53 


4.16 


1.42 




Mct-40 


8.06 


4.40 


1.95, 2,09 


H'2.51, 2.62 


Asn-41 


7.74 


5.00 


2.62, 2.81 


H 7.67, 6.95 


Pro-42 


8.68 


4.47 


2.29 


H4.94, H i.ol, ^^.o / 


Cys-43 




4.59 


3.22, J. I J 




Ser-44 


8.37 


4.46 


3.86 




Arg-45 


8.26 


4.43 


1.93 


T tY 1 A LJ^ ^ 0 1 KFM 7 ^ S 


Thr-46 


/ .oo 


4 1 5 


4,23 


CHSl.15 



Table 4. MARP atomic coordinates 



in 3 dimensional space determine by NMR at 500 mHz. 



ATOM 

ATOM 

ATOM 

ATOM 

.ATOM 

ATOM 

ATOM 

.ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

I 1 

12 

13 

14 

15 

16 

17 



CA 

HA 

HBl 

HB2 

C 

O 

CB 

SG 

N 

HTl 

HT2 
HT3 

N 

HN 

CA 
HA 
CB 



CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
C^ S 

c^s 
c^s 

CYS 

VAL 

VAL 

VAL 

VAL 

VAL 



2 
2 



-6.900 


2.269 


-8.250 


1.00 


4.62 


-5.897 


2.041 


-7.920 


1.00 


4.59 


-7.160 


4.143 


-9.269 


LOO 


5.13 


-6.230 


4.254 


-7.780 


LOO 


4.63 


-7.900 


1.623 


-7.293 


LOO 


3.97 


-9.067 


1.425 


-7.636 


1.00 


4.16 


-7.088 


3.796 


-8.249 


1.00 


4.78 


-8.576 


4,376 


-7.358 


1.00 


5.12 


-7.069 


1.766 


9.642 


1 .00 


5.38 


-8.026 


2.021 


-9.958 


1 .00 


5.84 


-6.942 


0.734 


-9.624 


1.00 


5.52 


-6.346 


-) 


-10.234 


1.00 


5.55 


-7.445 


1.356 


-6.070 


1.00 


3.32 


-6.517 


1.593 


-5.848 


LOO 


3.27 


-8.315 


0.801 


-5.028 


1.00 


2.84 


-9.187 


0.372 


-5.505 


LOO 


3.1 1 


-7.624 


-0.292 


-4.151 


1.00 


2.58 



-t-W / i - 'oV/ 7 1 \/ O 


0 










• 




ATOM 


18 


HB 


VAL 




-6.927 


0. 197 


'J 1 (in 


1 nn 

1 .VJVJ 


ATOM 


19 


CGI 


VAL 




-8.637 


- 1 .028 




1 .WVJ 


ATOM 


20 


HGl 1 


VAL 




-9.533 


-1 .215 






ATOM 


21 


HGl 2 


VAL 




-8.879 


4). 42 / 




1 Of) 

1 Afy) 


ATOM 




HGl 3 


VAL 


1 


-8.214 


- 1 .969 




1 Ofl 

1 .\)\) 


ATOM 




CG2 


VAL 




-6.851 


- 1 .2v> 


-.1 .UUU 


1 on 


ATOM 


24 


HG21 


VAL 


2 


-6.240 


-1 .922 




1 or) 


ATOM 


25 


HG22 


VAL 


2 


-6.214 


-U.7dd 




1 on 


ATOM 


26 


HG23 


VAL 


2 


-7.544 


-1.914 


-5.54o 


1 nn 
1 .uu 


ATOM 


27 


C 


VAL 


2 


-8.750 


1 .96 / 


A 1 
-4. 1 43 


1 00 
1 


ATOM 


28 


O 


VAL 


2 


-8.214 


3.0/1 


-4.ZO J 


\ 00 


ATOM 


29 


N 


ARG 


3 


-9.694 


1.759 


- J.Zj»o 


1 00 

1 .v/VJ 


ATOM 


30 


HN 


ARC 


3 


-10.100 


0.873 


-J. Kio 


1 00 


ATOM 


.1 1 


CA 


ARG 


3 


-10.120 


2.85 1 


J* oU 


1 00 


ATOM 


32 


HA 


ARG 


3 


-10.214 


o T 1 O 


-Z. Wo 


1 00 

i .\J\J 


ATOM 


33 


CB 


ARG 


3 


-11. 466 


2.J>39 


- 1 . / zu 


1 00 


ATOM 


34 


HBl 


ARG 


3 


-1 1.575 


3.15/ 


-U. OH 1 


1 00 


ATOM 


35 


HB2 


ARG 


3 


-1 1.477 


1 CA 1 

1.501 


1 All 


1 00 


ATOM 


36 


CG 


ARG 


3 


-12.662 


2.785 


-Z.OZo 


1 00 


ATOM 


37 


HGl 


ARG 


3 


-13.287 


1 O A C 

1 .90:) 


-Z.OZo 


1 00 


ATOM 


38 


HG2 


ARG 


3 


-12,307 


2.9 /o 


-J) .O-jW 


1 00 


ATOM 


39 


CD 


ARG 


3 


-13.486 


3.9 /o 




1 .00 


ATOM 


40 


HDl 


ARG 


3 


-13.348 


4. /oo 


-Z.oD J 


1 00 


ATOM 


41 


HD2 


ARG 


3 


-13.136 


A O < 


-1.1 


1 00 


ATOM 


42 


NE 


ARG 


3 


-14.913 


3.0D2 


-Z,\J 1 o 


1.00 


ATOM 


43 


HE 


ARG 


3 


-15.185 


2.745 




1 00 

1 . \J\J 


ATOM 


44 


CZ 


ARG 


3 


-15.856 


A C /I A 

4.540 


- 1 . / ZD 


1 00 


ATOM 


45 


NHl 


ARG 


3 


-15.532 


5.79D 


- 1 H-Z-? 


1 00 


ATOM 


46 


HHll 


ARG 


3 


-14.578 


AO 1 

D.09 1 


1 ASQ 

- 1 .H J >' 


1.00 


ATOM 


47 


HH12 


ARG 


3 


-16.246 


6.446 


-1.1 DZ 


1 00 


ATOM 


48 


NH2 


ARG 


3 


17.129 


A 1 /^A 

4 . 1 oU 


- 1 .u / / 


1.00 


ATOM 


49 


HH21 


ARG 


3 


-17.379 


3.219 


1 OO"^ 
- 1 . fyJS) 


1 .00 


ATOM 


50 


HH22 


ARG 


3 


-17.837 


4.8 lo 


- i .4 1 J> 


1 00 


ATOM 


51 


C 


ARG 


3 


-9.053 


O 1 AT 

3. 103 


1 


1 00 


ATOM 


52 


O 


ARG 


3 


-8.519 


1 


-vj. / o 


1 .00 


ATOM 


53 


N 


LEU 


4 


-8.725 


4.372 


1111 

-1.1 1 1 


1 00 


ATOM 


54 


HN 


LEU 


4 


-9. 167 


C AC^ 

5.08- 


1 A 1 S 

- 1 . D 1 J 


1 .00 


ATOM 


55 


CA 


LEU 


4 


-7.708 


4. / 2d 


-\ } . 1 H-U 


1 .00 


ATOM 


56 


HA 


LEU 


4 


-6.84 / 


1 1 1^ 


{) 37 i 


1 .00 



2.54 

2.26 

2.67 

2.47 

2.24 

3.27 

3.49 

3.65 

3.65 

2.69 

3.12 

2.64 

2.81 

2.90 

3.25 

3.25 

3.44 

3.38 

3.73 

4.00 

3.91 

4.23 

4.60 

4.30 

4.74 

4.90 

5.32 

5.50 

5.28 

6.06 

6.01 

6.19 

6.50 

2.75 

2.50 

3.24 

3.68 

3.40 

3.11 



4()7T-y8()'>»!()l S 














ATOM 


57 


CB 


LEU 


4 


-7.351 


6.196 


-0.264 


ATOM 


58 


HBl 


LEU 


4 


-7.379 


6.479 


-1.304 


ATOM 


59 


HB2 


LEU 


4 


-6.353 


6.339 


0.1 10 


ATOM 


60 


CG 


LEU 


4 


-8.285 


7.097 


0.500 


ATOM 


6! 


HG 


LEU 


4 


-8.344 


6.741 


1 .509 


ATOM 


62 


GDI 


LEUI 


4 


7.773 


8.520 


0.517 


ATOM 


63 


HDll 


LEU 


4 


-7.232 


8.714 


-0.396 


ATOM 


64 


HD12 


LEU 


4 


-7.113 


8.647 


1.363 


ATOM 


65 


HD13 


LEU 


4 


-8.605 


9.203 


0.599 


ATOM 


66 


CD2 


LEU 


4 


-9.659 


6.991 


-0.103 


ATOM 


67 


HD21 


LEU 


4 


-10.392 


7.382 


0.584 


ATOM 


68 


HD22 


LEU 


4 


-9.857 


5.945 


-0.295 


ATOM 


69 


HD23 


LEU 


4 


-9.690 


7.541 


-1.030 


ATOM 


70 


C 


LEU 


4 


-8.186 


4.418 


1.274 


ATOM 


71 


O 


LEU 


4 


-9.386 


4.241 


1.507 


ATOM 


72 


N 


HIS 


5 


-7.244 


4.346 


2.217 


ATOM 


73 


HN 


HIS 


5 


-6.311 


4.491 


1.965 


ATOM 


74 


CA 


HIS 


5 


-7.564 


4.044 


3.618 


ATOM 


75 


HA 


HIS 


5 


-6.629 


3.939 


4.150 


ATOM 


76 


CB 


HIS 


5 


-8.361 


5.193 


4.240 


ATOM 


77 


HBl 


HIS 


5 


-8.651 


4.923 


5.245 


ATOM 


78 


HB2 


HIS 


5 


-9.248 


5.373 


3.650 


ATOM 


79 


CG 


HIS 


5 


-7.583 


6.467 


4.309 


ATOM 


80 


NDl 


HIS 


5 


-8.108 


7.657 


4.768 


ATOM 


81 


HDl 


HIS 


5 


-9.021 


7.792 


5.094 


ATOM 


82 


CD2 


HIS 


5 


-6.300 


6.730 


3.966 


ATOM 


83 


HD2 


HIS 


5 


-5.586 


6.017 


3.564 


ATOM 


84 


CEl 


HIS 


5 


-7.182 


8.597 


4.702 


ATOM 


85 


HEl 


HIS 


5 


-7.307 


9.629 


4.993 


ATOM 


86 


NE2 


HIS 


5 


-6.076 


8.060 


4.221 


A'lOM 


87 


HE2 


HIS 


5 


-5.204 


8.506 


4.184 


ATOM 


88 


C 


HIS 


5 


-8.338 


2.726 


3.736 


ATOM 


89 


O 


HIS 


5 


-8.921 


2.423 


4.779 


ATOM 


90 


N 


GLU 


6 


-8.318 


1 .949 


2.656 


ATOM 


91 


HN 


GLU 


6 


-7.824 


2.247 


1.869 


ATOM 


92 


CA 


GLU 


6 


-8.984 


0.664 


2,592 


ATOM 


93 


HA 


GLU 


6 


-9.613 


0.554 


3.462 


ATOM 


94 


CB 


GLU 


6 


-9.838 


0,592 


1.327 


ATOM 


95 


HBl 


GLU 


6 


-9.174 


0.488 


0.478 



1 .00 

1.00 

1 ,00 

1 ,00 

1 ,00 

1 .00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1 .00 

1.00 



4.25 

4,60 

4.61 

4.65 

4.56 

5.17 

5.39 

5.60 

5.27 

5.22 

5.44 

5.42 

5.58 

3,37 

3.67 

3.48 

3.68 

3.77 

3.80 

4.60 

4.82 

4.71 

5.19 

5.95 

6.16 

5.27 

4.82 

6.46 

7.11 

6.10 

6.44 

3.44 

3.81 

2.90 

2.77 

2.75 

3.21 

2.53 

2.28 



4(j7T-')S()91()lS 



t\ 1 V^iVl 




A 1 WiVl 


/ 


A TOM 


/ o 


A T YAM 


/ y 


A 1 V_^iVl 


1 00 


\ TOM 


1 0 1 


ATOM 
A i vJiVl 


1 0^ 


A TOM 


1 0^ 


A TO\yT 


1 04 


A 1 UiVl 


1 OS 


A 1 LJIVI 


1 nfi 

i yjyj 


A TOIV/T 


1 07 


A TOM 


1 OR 


ATOM 


109 


ATOM 


110 


ATOM 


111 


ATOM 


112 



ATOM 113 

ATOM 1 14 

ATOM 115 

ATOM 116 

ATOM 117 

ATOM 1 1 8 



ATOM 


119 


ATOM 


120 


ATOM 


121 


ATOM 


122 


ATOM 


123 


ATOM 


124 


ATOM 


125 


ATOM 


126 


ATOM 


127 


ATOM 


128 


ATOM 


129 


ATOM 


130 


ATOM 


131 


ATOM 


132 


ATOM 


133 


ATOM 


134 




HB2 GLU 

CG GLi: 

HGl GLU 

HG2 GLU 

CD GLU 

OEl GLU 

OE2 GLli 

C GLU 

O GLU 

N SER 

HN SER 

CA SER 

HA SER 

CB SER 

HBl SER 

HB2 SER 

OG SER 

HG SER 

C SER 

O SER 

N CYS 

HN CYS 

CA CYS 

HA C^'S 



HBl 


C^'S 


HB2 


CYS 


C 


C^S 


O 


C^'S 


CB 


C^'S 


SG 


CYS 


N 


LEU 


HN 


LEU 


CA 


LEU 


HA 


LEU 


CB 


LEU 


HBl 


LEU 


HB2 


LEU 


CG 


LEU 


HG 


LEU 



6 -10.392 

6 -10.825 

6 -11.828 

6 -10.645 

6 -10.701 

6 -11.236 

6 -10.068 

6 -7.955 

6 -8.269 

7 -6.729 
7 -6.557 
7 -5.656 
7 -4.753 
7 -5.409 
7 -5.970 
7 -5.742 
7 -4.036 
7 -3.939 
7 -6.066 

7 -7.207 

8 -5.180 
8 -4.285 



8 


-5.558 


8 


-6.476 


8 


-3.606 


8 


-4.503 


8 


-5.763 


8 


-5.575 


8 


-4.568 


8 


-5.047 


9 


-6.155 


9 


-6.288 


9 


-6.400 


9 


-7.159 


9 


-5.122 


9 


-5.161 


9 


-4.275 


9 


-4.890 


9 


-4,327 



-60- 




1.513 1.227 

-0.566 1.307 

-0.167 1.356 

-1.192 2.168 

-1.416 0.055 

-1006 -0.997 

-2.491 0.127 

-0.456 2.557 

-1.602 2.876 

-0.115 2.130 

0.801 1.870 



-1.086 


2.013 


-0.563 


1 .693 


-1.725 


3.371 


-2.643 


3.442 


-1.040 


4.133 


-2.005 


3.572 


-2.883 


3.947 


-2.147 


0.984 


-2.143 


0.517 


-3.074 


0.643 


-3.074 


1.044 


-4.119 


-0.301 


-3.820 


-0.745 


-4.528 


-1.025 


-3.311 


-1.949 


-5.451 


0.432 


-6.522 


-0.140 


-4.261 


-1.425 


-5.510 


-2.659 


-5.353 


1.714 


-4.468 


2.0M 1 


-6.509 


2.578 


-7.116 


2.1 1 1 


-7.346 


2.756 


-7.820 


3.726 


-6.676 


2.739 


-8.438 


1.701 


-8.018 


0.881 



1 .00 


2.71 


1 .00 


2.93 


1 .00 


3.17 


1 .00 


3.31 


1.00 


3.19 


1,00 


3.57 


1.00 


3.55 


1.00 


2.53 


1.00 


2.98 


1.00 


2.02 


1.00 


1.87 


1.00 


1.98 


1.00 


1.74 


1.00 


2.69 


1.00 


3.00 



1.00 2.94 
1.00 3.30 
1.00 3.67 



1.00 


2.01 


1.00 


2.80 


1.00 


1.78 


1.00 


1.85 


1.00 


2.24 


1.00 


2.23 


1.00 


2.70 


1.00 


2.24 


1.00 


2.87 


1.00 


3.43 


1.00 


2.52 


1.00 


3.51 


1.00 


3.11 


1.00 


3.01 


1 .00 


3.82 


1.00 


4.03 


1.00 


4.45 


1.00 


4.87 


1.00 


4.44 


1.00 


4.92 


1.00 


4.96 



407 1 ! (!L S 


w 










• 


ATOM 


135 


CDl 


LEU 


9 


-4.073 


-9.583 


2.28 1 


ATOM 


136 


HDl 1 


LEU 


9 


-3.633 


-9.273 


3.217 


ATOM 


137 


HD12 


LEU 


9 


-3.291 


-9.852 


1 .587 


ATOM 


138 


HDl 3 


LEU 


9 


-4.715 


-10.435 


2.449 


ATOM 


1 39 


CD2 


LEU' 


9 


-6.214 


-8.95! 


1 . 1 49 


ATOM 


140 


11021 


LEU 


9 


-6.026 


-9 607 


0.3 13 


ATOM 


141 


HD22 


LEU 


9 


-6.812 


-8.1 1 1 


0.82 1 


ATOM 


142 


HD23 


LEU 


9 


-6.743 


-9.490 


1.921 


ATOM 


143 


C 


LEU 


9 


-6.919 


-6.051 


3.948 


ATOM 


144 


O 


LEU 


9 


-7.894 


-6.604 


4.462 


ATOM 


145 


N 


GLY 


10 


-6.267 


-5.033 


4.530 


ATOM 


146 


HN 


GLY 


10 


-5.501 


-4.627 


4.073 


ATOM 


1 AH 

1 ~t i 


A 


GLY 


10 


-6.681 


-4.513 


5.824 


ATOM 


148 


HAl 


GLY 


10 


-5.848 


-4.575 


6.50 / 


ATOM 


149 


HA2 


GLY 


10 


-7.490 


-5.120 


6.203 


ATOM 


150 


C 


GLY 


10 


-7.146 


-3.066 


5.745 


ATOM 


151 


O 


GLY 


10 


-8.151 


-2.771 


5.104 


ATOM 


152 


N 


GLN 


11 


-6.406 


-2.169 


6.402 


ATOM 


153 


HN 


GLN 


11 


-5.613 


-2.477 


6.880 


ATOM 


154 


CA 


GLN 


11 


-6.727 


-0.736 


6.40 / 


ATOM 


155 


HA 


GLN 


1 1 


-6.984 


-0.452 


5.397 


ATOM 


156 


CB 


GLN 


11 


-7.923 


-0.451 


7.325 


ATOM 


157 


HBl 


GLN 


11 


-7.690 


0.399 


7.949 


ATOM 


158 


HB2 


GLN 


11 


-8.094 


-1.312 


7.954 


ATOM 


159 


CG 


GLN 


11 


-9.210 


-0.148 


D.^ /U 


ATOM 


160 


HGl 


GLN 


11 


-9.043 


-0.320 


5.517 


ATOM 


161 


HG2 


GLN 


11 


-9.467 


0.889 


b. 1 1 1 


ATOM 


162 


CD 


GLN 


11 


-10.376 


-1.008 


/ X)Id 


ATOM 


163 


OEl 


GLN 


11 


-10.540 


-2.140 


6.5DO 


ATOM 


164 


NE2 


GLN 


11 


-11.195 


-0.474 




A'lOM 


165 


I1E2I 


GLN 


1 1 


-1 1.006 


0.433 


8.-44 


ATOM 


166 


HE22 


GLN 


1 1 


-1 1.958 


-1.009 


0 TOO 

8.2-V 


ATOM 


167 


C 


GLN 


1 1 


-5.510 


0.082 


0 . 0 5 i 


ATOM 


168 


O 


GLN 


1 1 


-5.279 


0.248 


8.U?t- 


ATOM 


169 


N 


GLN 


12 


-4.718 


0.570 




ATOM 


170 


HN 


GLN 


12 


-4.930 


0.397 


4.9j5 


ATOM 


171 


CA 


GLN 


12 


-3.520 


1.338 


0 , Z 1 V 


ATOM 


172 


HA 


GLN 


12 


-3.399 


1 .297 


7 29- 


ATOM 


173 


CB 


GLN 


12 


-2.294 


0.0 


J.J / J 



1 .00 

1 .00 

1 .00 

1 .00 

1 .00 

1 .00 

1 .00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1 .00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 



5.65 

5.93 

5.76 

6.14 

5.23 

5.47 

5.42 

5.45 

3.94 

4.53 

3.63 

3.32 

4.02 

4.32 

4.54 

3.77 

4.10 

3.61 

3.76 

3.67 

3.70 

4.51 

4.89 

4.88 

4.88 

5.09 

4.85 

5.60 

6.07 

6.05 

5.94 

6.67 

3.45 

4.02 

2.94 

2.77 

3.05 

3.57 

3.41 



-61- 



4()7T-9S09!()LS 

ATOM 1 74 

ATOM 1 75 

ATOM 1 76 

ATOM 177 

ATOM 178 

ATOM 1 79 

ATOM 180 

ATOM 181 

ATOM 182 

ATOM 183 

ATOM 184 

ATOM 185 

ATOM 186 



A i UM 


1 5^7 
1 o / 


A 1 kJNi 


i o o 


A 1 UiVi 




A i iJM 


1 on 


A 1 UM 


1 O 1 


A 1 UM 


1 Q9 
1 


A 1 UM 




A 1 UM 


1 


A 1 UM 


1 

1 V J 


ATOM 


196 


ATOM 


197 


ATOM 


198 


ATOM 


199 


ATOM 


200 


ATOM 


201 


ATOM 


202 


ATOM 


203 


ATOM 


204 


ATOM 


205 


ATOM 


206 


ATOM 


207 


ATOM 


208 


ATOM 


209 


ATOM 


210 


ATOM 


211 


ATOM 


212 











HBl 


GLN 


12 


-1.434 


HB2 


GLN 


12 


-2.471 


CO 


C}LN 


12 


-1.976 


HGl 


GLN 


12 


-1.131 


HG2 


GLN 


12 


-2.834 


CD 


GLN 


12 


-1.639 


OEl 


GLN 


12 


-0.584 


NE2 


GLN 


12 


-2.538 


HE21 


GLN 


12 


-3.359 


HE22 


GLN 


12 


-2.342 


C 


GLN 


12 


-3.640 


o 


GLN 


12 


-4.438 


N 


VAT. 


13 


-2.821 


HN 


VAL 


13 


-2.199 


CA 


VAL 


13 


-2.795 


HA 


VAL 


13 


-3.226 


CB 


VAL 


13 


-1.331 



no 


VAL 


13 


-1.291 


L-vj i 


V AT 


1 3 


-0.380 


HGll 


VAL 


13 


-0.395 


HG12 


VAL 


13 


-0.691 


HGl 3 


VAL 


13 


0.622 


CG2 


VAL 


13 


-0.878 


HG21 


VAL 


13 


-0.918 


HG22 


VAL 


13 


-1.528 


HG23 


VAL 


13 


0.135 


C 


VAL 


13 


-3.615 


O 


VAL 


13 


-4.211 


N 


PRO 


14 


-3.671 


CA 


PRO 


14 


-4.438 


HA 


PRO 


14 


■5.448 


CB 


PRO 


14 


-4.454 


HBl 


PRO 


14 


-5.342 


HB2 


PRO 


14 


-4.437 


CG 


PRO 


14 


-3.224 


HGl 


PRO 


14 


-3.368 


HG2 


PRO 


14 


-2.384 


CD 


PRO 


14 


-3.010 


HDl 


PRO 


14 


-3.474 



-62- 




1.314 


5.730 


1.00 


3.57 


).583 


4.512 


1.00 


3.50 


-0.700 


6. 1 30 


1 .00 


4.12 


-1.105 


5.591 


1 .00 


4.31 


- 1 .340 


5.984 


1.00 


4.40 


-0.67 1 


7.609 


1.00 


4.78 


-0.178 


8.007 


1.00 


5.27 


-1.199 


8.436 


1.00 


5.19 


-1.574 


8.052 


1.00 


5.11 


-1.191 


9.396 


1.00 


5.76 


2.820 


5.813 


LOO 


2.91 


3.561 


6.388 


1.00 


3.44 


3.248 


4.843 


1.00 


2.75 


2.625 


4.444 


1.00 


2.86 


4.629 


4.377 


1.00 


3.05 


5.246 


5.153 


1.00 


3.57 


5.091 


4,154 


1.00 


3.46 


6.137 


4.363 


1.00 


3.85 


4.402 


5.127 


1.00 


3.80 


3.337 


4.956 


1.00 


3.78 


4.607 


6.140 


1.00 


4.15 


4.776 


4.976 


1.00 


4.28 


4.877 


2.709 


1.00 


3.89 


5.815 


2.176 


1.00 


4.21 


4.163 


2.224 


1.00 


4.18 


4.503 


2.700 


1.00 


4.14 


4.813 


3.090 


1.00 


3.04 


3.857 


2.589 


1.00 


2.95 


6.053 


2.539 


1.00 


3.66 


6.341 


1.318 


1.00 


4.09 


5.975 


1.401 


1.00 


4.39 


7.870 


1.263 


1.00 


5.04 


8.242 


1.750 


1.00 


5.62 


8.197 


0.234 


1 .00 


5.18 


8.289 


1.986 


1.00 


5.32 


9.268 


2.418 


1.00 


5.91 


8.296 


1.308 


1.00 


5.77 


7.267 


3.068 


1.00 


4.46 


7.590 


3.989 


1.00 


4.68 



4()7T-')S{W!()L'S 

ATOM 213 

ATOM 214 

ATOM 215 

ATOM 216 

ATOM 217 

ATOM 218 

ATOM 219 

ATOM 220 

ATOM 221 

ATOM 222 

ATOM 223 

ATOM 224 



A i UM 




A lUM 




A lOM 


JL— 1 


ATOM 


228 


ATOM 


229 


ATOM 


230 


ATOM 


231 


ATOM 


232 


ATOM 


233 


ATOM 


234 


ATOM 


235 


ATOM 


236 


ATOM 


237 


ATOM 


238 


ATOM 


239 


ATOM 


240 


ATOM 


241 


ATOM 


242 


AlOM 


243 



ATOM 244 

ATOM 245 

ATOM 246 

ATOM 247 

ATOM 248 

ATOM 249 

ATOM 250 

ATOM 25 1 



w 




HD2 


PR(J 


C 


PRO 


O 


PRO 


N 


CYS 


HN 


C>S 


CA 


C\S 


HA 


C\S 


HBl 


C^S 


HB2 


CYS 


C 


CYS 


O 


CYS 


CB 


CYS 


SG 


CYS 


N 


CYS 


HN 


CYS 


CA 


CYS 


HA 


C\S 


HBl 


CYS 



HB2 CYS 

C CYS 

O CYS 

CB CYS 

SG CYS 

N ASP 

HN ASP 



CA 


ASP 


HA 


ASP 


CB 


ASP 


HBl 


ASP 


HB2 


ASP 


CG 


ASP 


ODl 


ASP 


OD2 


ASP 


C 


ASP 


O 


ASP 


N 


PRO 


CA 


PRO 


HA 


PRO 


CB 


PRO 



14 -1.956 

14 -3.795 

14 -2.821 

15 -4. .341 
15 -5.104 
15 3.814 
15 -2.780 
15 -4.871 
15 -3.464 
15 -4.533 
15 -4.584 
15 -3.851 
15 -2.856 



16 


-5.057 


16 


-4.985 


16 


-5.710 


16 


-5.636 


16 


-7.294 


16 


-7.597 


16 


-4.979 


16 


-5.610 


16 


-7.196 




ft 9 1 ^ 
-o . — i u 


17 


-3.630 


17 


-3.198 


17 


-2.777 


17 


-2.901 


17 


-3.219 


17 


-3.503 


17 


-4.072 


17 


-2.132 


17 


-1.748 


17 


-1.668 


17 


-1.283 


17 


-0.465 


18 


-0.908 


18 


0.485 


18 


1.165 


18 


0.438 




7.102 3.216 

5.765 0.053 

5.013 0.129 

6.138 -iT14 

6.751 -1.110 

5.676 -2 394 

5.986 -2.427 

3.825 -2.642 

3.720 -1.587 

6.333 -3.587 

7.561 -3.661 

4.149 -2.500 



3.502 


-3.887 


5.523 


-4.536 


4.556 


-4.441 


6.043 


-5.736 


5.275 


-6.496 


7.293 


-4.999 


5.564 


-4.810 


7.301 


-6.249 


8.273 


-6.672 


6.327 


-5.471 


6.341 


-6.984 


7 "'59 


-6.197 


6.451 


-5.847 


8.374 


-6.639 


8.469 


-7.708 


9.692 


-5.975 


9.496 


-4.953 


10.083 


-6.508 


10.754 


-5.977 


1 1.210 


-7.075 


11.129 


-4.880 


8.093 


-6.351 


8.102 


-7.271 


7.839 


-5.070 


7.553 


-4.669 


8.301 


-5.023 


7.600 


-3.127 



1.00 


4.53 


1.00 


3.77 


1 .00 


4.07 


1 .00 


3.60 


1.00 


3.75 


1.00 


3.70 


1.00 


3.90 


1.00 


4.62 


1.00 


4.34 


1.00 


3.85 


1.00 


4.33 


1.00 


4.05 


1.00 


4.02 


1.00 


3.70 


1.00 


3.50 


1.00 


4.08 


1.00 


3.93 



1.00 4.46 

1.00 4.09 

1.00 4.59 

1.00 5.05 

1.00 4.36 

1.00 5.13 



1.00 


4.60 


1.00 


4.30 


1.00 


5.13 


1.00 


5.59 


1.00 


5.62 


1.00 


5.76 


1.00 


5.74 


1.00 


6.17 


1.00 


6.53 


1.00 


6.56 


1.00 


4.80 


1.00 


4.56 


1.00 


5.13 


1.00 


5.06 


1.00 


5.33 


1.00 


5.85 
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ATOM 


252 


HBl 


PRO 


18 


1.243 


8.221 


-2.761 


1 .00 


6.32 


ATOM 


253 


HB2 


PRO 


18 


0.542 


6.60 1 


-2.729 


1.00 


5.79 


ATOM 


254 


CG 


PRO 


18 


-0.891 


8.182 


-2.786 


1 .00 


6.39 


ATOM 


255 


HGl 


PRO 


18 


-0.814 


9.257 


-2.712 


1 .00 


6.94 


ATOM 


256 


HG2 


PRO 


18 


-1.249 


7.765 


-1.857 


1 .00 


6.75 


ATOM 


257 


CD 


PRO 


18 


-1.800 


7 803 


-3.912 


1 .00 


5.92 


ATOM 


258 


HDl 


PRO 


18 


-2.598 


8.519 


-4.012 


1.00 


6.35 


ATOM 


259 


HD2 


PRO 


18 


-2.194 


6.810 


-3.762 


1.00 


5.99 


ATOM 


260 


C 


PRO 


18 


0.965 


6.192 


-5.162 


1.00 


4.35 


ATOM 


261 


O 


PRO 


18 


0.484 


5.690 


-6.181 


1.00 


4.45 


ATOM 


262 


N 


CYS 


19 


1.908 


5.589 


-4.438 


1.00 


4.00 


ATOM 


263 


HN 


CYS 


19 


2.257 


6.027 


-3.634 


1.00 


4.34 


ATOM 


264 


CA 


CYS 


19 


2.427 


4.285 


-4.822 


1.00 


3.50 


ATOM 


265 


HA 


CYS 


19 


2.416 


4.237 


-5.903 


1.00 


3.76 


ATOM 


266 


HBl 


CYS 


19 


4.354 


5.069 


-4.280 


1.00 


4.02 


ATOM 


267 


HB2 


CYS 


19 


4.412 


3.492 


-5.049 


1.00 


4.20 


ATOM 


268 


C 


CYS 


19 


1.501 


3.201 


-4.285 


1.00 


2.98 


ATOM 


269 


O 


CYS 


19 


0.949 


2.444 


-5.067 


1.00 


2.87 


ATOM 


270 


CB 


C^'S 


19 


3.880 


4.100 


-4.333 


1.00 


3.68 


ATOM 


271 


SO 


CYS 


19 


4.070 


3.305 


-2.701 


1.00 


3.27 


ATOM 


272 


N 


ALA 


20 


1.301 


3.192 


-2.958 


1.00 


3.00 


ATOM 


273 


HN 


ALA 


20 


1.746 


3.891 


-2.427 


1.00 


3.43 


ATOM 


274 


CA 


ALA 


20 


0.381 


2.259 


-2.261 


1.00 


2.68 


ATOM 


275 


HA 


ALA 


20 


0.867 


1.965 


-1.352 


1.00 


2.43 


ATOM 


276 


CB 


ALA 


20 


-0.896 


2.998 


-1.882 


1.00 


2.88 


ATOM 


277 


HBl 


ALA 


20 


-1.747 


2.351 


-2.033 


1.00 


3.03 


ATOM 


278 


HB2 


ALA 


20 


-0.997 


3.878 


-2.499 


1.00 


3.21 


ATOM 


279 


HB3 


ALA 


20 


-0.847 


3.293 


-0,841 


1.00 


3.09 


ATOM 


280 


C 


ALA 


20 


0.077 


0.972 


-3.049 


1.00 


2,88 


ATOM 


281 


O 


ALA 


20 


-0.665 


1.040 


-3.996 


1.00 


3.63 


ATOM 


282 


N 


THR 


21 


0.639 


-0.179 


-2.606 


1.00 


2,52 


ATOM 


283 


HN 


THR 


21 


1.178 


-0.127 


-1.824 


1 .00 


2.17 


ATOM 


284 


CA 


THR 


21 


0.492 


-1.519 


-3.247 


1.00 


3.01 


ATOM 


285 


HA 


THR 


21 


0.640 


-1.41 1 


-4.288 


1 .00 


3.44 


ATOM 


286 


CB 


THR 


21 


1.615 


-2.430 


-2.712 


1 .00 


3.51 


ATOM 


287 


HB 


THR 


21 


2.292 


-1.827 


-2,145 


1.00 


3.87 


ATOM 


288 


OGl 


THR 


21 


2.342 


-3.018 


-3,765 


1.00 


3.92 


ATOM 


289 


HGl 


THR 


21 


1.829 


-3.73 1 


-4,147 


1.00 


4.04 


ATOM 


290 


CG2 


THR 


21 


1.139 


-3.534 


-1,814 


1 .00 


3.64 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



291 

292 

293 

294 

295 

296 

297 

298 

299 

300 

301 

302 

303 

304 

305 

306 

307 

308 

309 

310 

311 

312 

313 

314 

315 

316 

317 

318 

319 

320 

321 

322 

323 

324 

325 

326 

327 

328 

329 



HG21 

HG22 

HG23 

C 

O 

N 

HN 

CA 

HA 

HBl 

HB2 

C 

O 

CB 

SG 

N 

HN 

CA 

HA 

CB 

HBl 

HB2 

CG 

CDl 

HDl 

CD2 

HD2 

CEl 

Iffil 

CE2 

HE2 

CZ 

OH 

HH 

C 

O 

N 

HN 

CA 



THR 
THR 
IHR 
THR 
THR 
CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
CYS 
C^S 
CYS 
T^'R 
T^'R 
T^'R 
T^'R 
TYR 
TYR 
T^R 
T^'R 
T^'R 
T^'R 
T^'R 
TYR 
T^'R 
T^'R 
T\'R 
T1 R 
T^ R 
T^'R 
T^'R 
TYR 
TYR 
CYS 
CYS 
CYS 



21 

21 

21 

21 

21 

22 

22 

22 

22 

~>i 

2^ 

22 

22 

22 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

23 

24 

24 

24 



1.148 

0.138 

1.776 

-0.893 

-1.823 

- 1 .047 

-0.284 

-2.361 

-2.941 

-2.576 

-3.066 

-2.268 

-2.573 

-3.092 

-4.841 

-1.888 

-1.689 

-1.799 

-2.770 

-1.483 

-0.990 

-2.406 

-0.599 

-1.141 

-2.212 

0.776 

1.205 

-0.332 

-0.771 

1 .595 

2.667 

1.036 

1.846 

2.160 

-0.764 

-0.942 

0.305 

0.376 

1 .379 



-4.469 

-3.302 

-3.606 

-2.141 

-1.429 

-3.453 

-3.987 

-4.091 

-3.546 

-4.621 

-2.975 

-5.563 

-6.463 

-3.999 

-4.522 

-5.785 

-5.024 

-7.133 

-7.368 

-8.137 

-7.599 

-8.540 

-9.294 

-10.531 

-10.660 

-9.147 

-8.183 

-11.592 

-12.547 

-10.200 

-10.065 

-11.421 

-12.475 

-12.917 

-7.219 

-7.990 

-6.427 

-5.836 

-6.392 



-2.352 

-1.513 

-0.94() 

-3.040 

-2.757 

-3.225 

-3.488 

-3.092 

-2.350 

-5.160 

-4.783 

-2.656 

-3.444 

-4.442 

-4.418 

-1.377 

-0.821 

-0.783 

-0.375 

-1.875 

-2.663 

-2.250 

- 1 .468 

-1.148 

-1.183 

-1.418 

-1.658 

-0.790 

-0.545 

- 1 .059 

-1.025 

-0.748 

-0.395 

-1.187 

0.355 

1.300 

0.252 

-0.520 

1 .250 



1 .00 

1 .00 

1 .00 

1 .00 

1 .00 

1 .00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1 .00 

1.00 

1.00 



3.73 

3.73 

3.90 

2.82 

2.79 

3.23 

3.75 

3.29 

2.73 

4.32 

4.25 

3.83 

4.63 

3.86 

3.79 

3.64 

3.23 

4.31 

4.75 

4.99 

4.83 

5.71 

5.19 

5.34 

5.46 

5.53 

5.77 

5.56 

5.82 

5.81 

6.26 

5.71 

6,01 

6.05 

4.08 

4.31 

3.99 

4.20 

^ Q7 
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ArOM 


330 


HA 


CYS 


24 


2.008 


-5.546 


0.990 


1 .00 


3.54 


ATOM 


331 


HBl 


CYS 


24 


0.849 


-7.090 


3.203 


1 .00 


4.45 


ATOM 


332 


HB2 


CYS 


24 


-0.159 


-5.779 


2.608 


1 .00 


4.72 


ATOM 


333 


C 


CYS 


24 


2.230 


-7.664 


1.271 


1 .00 


4.56 


ATOM 


334 


O 


CYS 


24 


1 .702 


-8.775 


1 .300 


1 .00 


4.95 


ATOM 


335 


CB 


CYS 


24 


0.841 


-6.167 


2.655 


1.00 


4.21 


ATOM 


336 


SG 


CYS 


24 


1.818 


-4.99 1 


3.578 


1.00 


3.97 


ATOM 


337 


N 


ARG 


25 


3.553 


-7.485 


1.306 


1.00 


4.81 


ATOM 


338 


HN 


ARG 


25 


3.910 


-6.572 


1.315 


1.00 


4.70 


ATOM 


339 


CA 


ARG 


25 


4.485 


-8.614 


1.370 


1.00 


5.41 


ATOM 


340 


HA 


ARG 


25 


4.148 


-9.364 


0.676 


1.00 


5.78 


ATOM 


341 


CB 


ARG 


25 


5.903 


-8.174 


0.986 


1.00 


5.35 


ATOM 


342 


HBl 


ARG 


25 


6.586 


-8.982 


1 .206 


1.00 


5.48 


ATOM 


343 


HB2 


ARG 


25 


6.173 


-7.317 


1.584 


1.00 


5.55 


ATOM 


344 


CG 


ARG 


25 


6.065 


-7.802 


-0.478 


1.00 


5.19 


ATOM 


345 


HGl 


ARG 


25 


5.093 


-7.787 


-0.945 


1.00 


5.37 


ATOM 


346 


HG2 


ARG 


25 


6.687 


-8.541 


-0.961 


1.00 


5.36 


ATOM 


347 


CD 


ARG 


25 


6.712 


-6.433 


-0.636 


1.00 


5.12 


ATOM 


348 


HDl 


ARG 


25 


6.606 


-5.890 


0.292 


1.00 


5.22 


ATOM 


349 


HD2 


ARG 


25 


6.201 


-5.899 


-1.423 


1.00 


4.97 


ATOM 


350 


NE 


ARG 


25 


8.134 


-6.527 


-0.973 


1.00 


5.60 


ATOM 


351 


HE 


ARG 


25 


8.367 


-6.596 


-1.923 


1.00 


5.89 


ATOM 


352 


CZ 


ARG 


25 


9.125 


-6.525 


-0.074 


1.00 


5.97 


ATOM 


353 


NHl 


ARG 


25 


8.863 


-6.428 


1.227 


1.00 


5.99 


ATOM 


354 


HHll 


ARG 


25 


7.920 


-6.356 


1.546 


1.00 


5.75 


ATOM 


355 


HH12 


ARG 


25 


9.614 


-6.428 


1.888 


1.00 


6.45 


ATOM 


356 


NH2 


ARG 


25 


10.386 


-6.622 


-0.477 


1.00 


6.62 


ATOM 


357 


HH21 


ARG 


25 


10.595 


-6.697 


-1.451 


1.00 


6.89 


ATOM 


358 


IIH22 


ARG 


25 


1 1 , 1 27 


-6.620 


0.194 


1.00 


6.99 


ATOM 


359 


C 


ARG 


25 


4.514 


-9.217 


2.773 


1.00 


5.75 


ATOM 


360 


O 


ARG 


25 


4.289 


-10.415 


2.952 


1.00 


6.36 


ATOM 


361 


N 


PHE 


26 


4.802 


-8.369 


3.759 


1.00 


5.44 


ATOM 


362 


HN 


PHE 


26 


4.967 


-7.433 


3.534 


1 .00 


4.97 


ATOM 


363 


CA 


PHE 


26 


4.879 


-8.793 


5.158 


1.00 


5.87 


ATOM 


364 


HA 


PHE 


26 


4.990 


-9.862 


5.151 


1.00 


6.37 


ATOM 


365 


CB 


PHE 


26 


6.121 


-8.187 


5.838 


1.00 


5.87 


ATOM 


366 


HBl 


PHE 


26 


6.993 


-8.742 


5.519 


1.00 


6.14 


ATOM 


367 


HB2 


PHE 


26 


6.026 


-8.279 


6.907 


1.00 


5.89 


ATOM 


368 


CG 


PHE 


26 


6.361 


-6.740 


5.515 


1.00 


5.68 
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ATOM 


369 


CD! 


PHE 


26 


6.033 


-5.748 


6.420 


1.00 


5.91 


ATOM 


370 


HDl 


PHE 


26 


5.595 


-6.022 


7.368 


1.00 


6.30 


ATOM 


371 


CD2 


PHE 


26 


6.920 


-6.38 1 


4.303 


1 .00 


5.59 


ATOM 


372 


HD2 


PHE 


26 


7.172 


-7.156 


3.593 


1 .00 


5.76 


ATOM 


373 


CEl 


PHE 


26 


6.259 


-4.419 


6.122 


1.00 


5.89 


ATOM 


374 


HEl 


PHE 


26 


5 999 


-3.653 


6.837 


1.00 


6.28 


ATOM 


375 


CE2 


PHE 


26 


7.150 


-5.054 


3.995 


1.00 


5.57 


ATOM 


376 


HE2 


PHE 


26 


7.587 


-4.787 


3.044 


1.00 


5.74 


ATOM 


377 


CZ 


PHE 


26 


6.819 


-4.07 1 


4.907 


1.00 


5.65 


ATOM 


378 


HZ 


PHE 


26 


6.997 


-3.032 


4.670 


1.00 


5.75 


ATOM 


379 


C 


PHE 


26 


3.576 


-8.452 


5.906 


1.00 


5.78 


ATOM 


380 


O 


PHE 


26 


2.491 


-8.694 


5.373 


1.00 


5.69 


ATOM 


381 


N 


PHE 


27 


3.669 


-7.902 


7.131 


1.00 


5.97 


ATOM 


382 


HN 


PHE 


27 


4.549 


-7.733 


7.517 


1.00 


6.18 


ATOM 


383 


CA 


PHE 


27 


2.477 


-7.552 


7.910 


1.00 


6.04 


ATOM 


384 


HA 


PHE 


27 


1.921 


-8.463 


8.076 


1.00 


6.43 


ATOM 


385 


CB 


PHE 


27 


2.876 


-6.962 


9.273 


1. 00 


6.49 


ATOM 


386 


HBl 


PHE 


27 
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Table 5. MARP atomic coordinates in 3 dimensional space determine by NMR at 800 



mHz. 
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0.00 


C 


A 


26 


2.227 


10.315 


1.814 


1.00 


0.00 


o 


A 


26 


2.964 


1 1 .769 


-1.380 


1.00 


0.00 


c 


A 


26 


4.046 


12.730 


-1.783 


1.00 


0.00 


c 


A 


26 


4.284 


13.002 


-3.120 


1.00 


0.00 


c 


A 


26 


4.824 


13.363 


-0.826 


1.00 


0.00 


c 


A 


26 


5.279 


13.884 


-3.496 


1.00 


0.00 


c 


A 


26 


5.820 


14.246 


-1.195 


1.00 


0.00 


c 


A 


26 


6.047 


14.508 


-2.532 


1 .00 


0.00 


c 


A 


26 




9.077 


-0.401 


1.00 


0.00 


H 
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MOM 


371 


HA 


PHE 


A 


26 


4.427 


10.938 


-0.049 


1 .00 


0.00 


H 


ATOM 


372 


IHB 


PHE 


A 


26 


2. 711 


11.173 


-2.243 


1 .00 


0.00 


H 


ATOM 


373 


2HB 


PHE 


A 


26 


2.099 


12.346 


- 1 .083 


1 .00 


0.00 


H 


ATOM 


374 


HDl 


PHE 


A 


26 


3.684 


12.516 


-3.875 


1 .00 


0.00 


H 


ATOM 


375 


HD2 


PHE 


A 


26 


4.647 


13.158 


0.221 


1 .00 


0.00 


H 


ArOM 


376 


HFI 


PHE 


A 


26 


5.457 


14.087 


-4.542 


1 .00 


0.00 


H 


ATOM 


377 


HE2 


PHE 


A 


26 


6.419 


14.732 


-0.439 


1.00 


0.00 


H 


ATOM 


378 


HZ 


PHE 


A 


26 


6.825 


15.198 


-2.822 


1.00 


0.00 


H 


ATOM 


379 


N 


PHE 


A 


27 


2.389 


12.484 


1.258 


1.00 


0.00 


N 


ATOM 


380 


CA 


PHE 


A 


27 


1 .674 


12.953 


2.425 


1.00 


0.00 


c 


ATOM 


381 


C 


PHE 


A 


27 


0.292 


12.313 


2.499 


1.00 


0.00 


c 


ATOM 


382 


O 


PHE 


A 


27 


0.004 


11.530 


3.404 


1.00 


0.00 


o 


ATOM 


383 


CB 


PHE 


A 


27 


1 .552 


14.479 


2.388 


1.00 


0.00 


c 


ATOM 


384 


CG 


PHE 


A 


27 


2.188 


15.144 


1.197 


1.00 


0.00 


c 


ATOM 


385 


CDl 


PHE 


A 


27 


3.479 


15.642 


1.273 


1.00 


0.00 


c 


ATOM 


386 


CD2 


PHE 


A 


27 


1.495 


15.270 


0.004 


1.00 


0.00 


c 


ATOM 


387 


CEl 


PHE 


A 


27 


4.067 


16.253 


0.182 


1.00 


0.00 


c 


ATOM 


388 


CE2 


PHE 


A 


27 


2.077 


15.878 


-1.092 


1.00 


0.00 


c 


ATOM 


389 


CZ 


PHE 


A 


27 


3.364 


16.372 


-1.002 


1.00 


0.00 


c 


ATOM 


390 


H 


PHE 


A 


27 


2.704 


13.137 


0.616 


1.00 


0.00 


H 


ATOM 


391 


HA 


PHE 


A 


27 


2.240 


12.666 


3.299 


1.00 


0.00 


H 


ATOM 


392 


IHB 


PHE 


A 


27 


0.513 


14.742 


2.382 


1.00 


0.00 


H 


ATOM 


393 


2HB 


PHE 


A 


27 


2.015 


14.880 


3.269 


1.00 


0.00 


H 


ATOM 


394 


HDl 


PHE 


A 


27 


4.030 


15.550 


2.199 


1.00 


0.00 


H 


ATOM 


395 


HD2 


PHE 


A 


27 


0.489 


14.884 


-0.068 


1.00 


0.00 


H 


ATOM 


396 


HEl 


PHE 


A 


27 


5.073 


16.637 


0.254 


1.00 


0.00 


H 


ATOM 


397 


HE2 


PHE 


A 


27 


1.527 


15.970 


-2.015 


1.00 


0.00 


H 


ATOM 


398 


HZ 


PHE 


A 


27 


3.822 


16.848 


-1.857 


1.00 


0.00 


H 


ATOM 


399 


N 


ASN 


A 


28 


-0.555 


12.648 


1.532 


1.00 


0.00 


N 


ATOM 


400 


CA 


ASN 


A 


28 


-1.907 


12.105 


1.474 


1.00 


0.00 


c 


ATOM 


401 


C 


ASN 


A 


28 


-2.437 


12.127 


0.043 


1.00 


0.00 


c 


ATOM 


402 


O 


ASN 


A 


28 


-3.223 


13.002 


-0.322 


1.00 


0.00 


o 


ATOM 


403 


CB 


ASN 


A 


28 


-2.840 


12.901 


2.391 


1.00 


0.00 


c 


ATOM 


404 


CG 


ASN 


A 


28 


-2.653 


14.399 


2.249 


1.00 


0.00 


c 


ATOM 


405 


ODl 


ASN 


A 


28 


-1.863 


15.009 


2.970 


1.00 


0.00 


o 


ATOM 


406 


ND2 


ASN 


A 


28 


-3.381 


15.001 


1.317 


1.00 


0.00 


N 


ATOM 


407 


H 


ASN 


A 


28 


-0.262 


13.273 


0.836 


1.00 


0.00 


H 


ATOM 


408 


HA 


ASN 


A 


28 


-1.868 


11.081 


1.815 


1.00 


0.00 


H 
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ATOM 


409 


IHB 


ASN 


ATOM 


410 


2HB 


ASN 


ATOM 


411 


1HD2 


ASN 


ATOM 


412 


2HD2 


ASN 


ATOM 


413 


N 


ALA 


ATOM 


414 


CA 


ALA 


ATOM 


415 


C 


ALA 


ATOM 


416 


O 


ALA 


ATOM 


417 


CB 


ALA 


ATOM 


418 


H 


ALA 


ATOM 


419 


HA 


ALA 


ATOM 


420 


IHB 


ALA 


ATOM 


421 


2HB 


ALA 


ATOM 


422 


3HB 


ALA 


ATOM 


423 


N 


PHE 


ATOM 


424 


CA 


PHE 


ATOM 


425 


C 


PHE 


ATOM 


426 


O 


PHE 


ATOM 


427 


CB 


PHE 


ATOM 


428 


CG 


PHE 


ATOM 


429 


CDl 


PHE 


ATOM 


430 


CD2 


PHE 


ATOM 


431 


CEl 


PHE 


ATOM 


432 


CE2 


PHE 


ATOM 


433 


CZ 


PHE 


ATOM 


434 


H 


PHE 


ATOM 


435 


HA 


PHE 


ATOM 


436 


IHB 


PHE 


ATOM 


437 


2HB 


PHE 


ATOM 


438 


HDl 


PHE 


ATOM 


439 


HD2 


PHE 


ATOM 


440 


HEI 


PHE 


ATOM 


441 


HE2 


PHE 


ATOM 


442 


HZ 


PHE 


ATOM 


443 


N 


CYS 


ATOM 


444 


CA 


CYS 


ATOM 


445 


C 


CYS 


ATOM 


446 


O 


CYS 



A 28 -3.864 12.661 

A 28 -2.646 12.626 

A 28 -3.990 14.452 

A 28 -3.280 15.969 

A 29 -2.002 11.166 

A 29 -2.439 11.092 

A 29 -2.410 9.657 

A 29 -3.456 9.057 

A 29 -1.571 11.985 

A 29 -1.370 10.496 

A 29 -3.453 11.460 

A 29 -1.817 11.822 

A 29 -0.530 1 1 .747 

A 29 -1.748 13.020 

A 30 -1.210 9.116 

A 30 -1.053 7.753 

A 30 -L161 6.744 

A 30 -0.212 6.017 

A 30 0.293 7.598 

A 30 0.239 7.930 

A 30 -0.238 9.157 

A 30 0.667 7.012 

A 30 -0.288 9.465 

A 30 0.621 7.315 

A 30 0.142 8.542 

A 30 -0.411 9.645 

A 30 -1.848 7.564 

A 30 1.015 8.254 

A 30 0.627 6.575 

A 30 -0.575 9.880 

A 30 1.041 6.052 

A 30 -0.662 10.426 

A 30 0.956 6.591 

A 30 0.105 8.780 

A 31 -2.327 6.707 

A 31 -2.563 5.790 

A 31 -3.314 4.548 

A 31 -4.529 4,582 




2.146 


1 .00 


0.00 


H 


3.417 


i .00 


0.00 


H 


0.770 


1 .00 


0.00 


H 


1.203 


1 .00 


0.00 


H 


-0.766 


1 .00 


0.00 


N 


-2.155 


1 OO 


0.00 


C 


-2.673 


1.00 


0.00 


c 


-2.917 


1.00 


0.00 


o 


-3.028 


1.00 


0.00 


c 


-0.422 


1.00 


0.00 


H 


-2.205 


1.00 


0.00 


H 


-4.068 


1.00 


0.00 


H 


-2.865 


1.00 


0.00 


H 


-2.773 


1.00 


0.00 


H 


-2.842 


1.00 


0.00 


N 


-3.336 


1.00 


0.00 


c 


-2.196 


1.00 


0.00 


c 


-1.903 


1.00 


0.00 


o 


-4.048 


LOO 


0.00 


c 


-5.513 


1.00 


0.00 


c 


-5.942 


1.00 


0.00 


c 


-6.459 


1.00 


0.00 


c 


-7.289 


1.00 


0.00 


c 


-7.807 


1.00 


0.00 


c 


-8.222 


1.00 


0.00 


c 


-2.633 


1.00 


0.00 


H 


-4.042 


1.00 


0.00 


H 


-3.586 


1.00 


0.00 


H 


-3.950 


1.00 


0.00 


H 


-5.213 


1.00 


0.00 


H 


-6.136 


1.00 


0.00 


H 


-7.610 


1.00 


0.00 


H 


-8.535 


1.00 


0.00 


H 


-9.276 


1.00 


0.00 


H 


-1.557 


1.00 


0.00 


N 


-0.448 


1.00 


0.00 


C 


-0.921 


1.00 


0.00 


C 


-1.107 


1.00 


0.00 


o 
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ATOM 


447 


CB 


CYS 


A 


31 


-3.354 


6.487 


0.658 


1.00 


().()() 


c 


ATOM 


448 


SG 


C>S 


A 


31 


-2.525 


7.950 


1 .359 


1 .00 


0.00 


s 


ATOM 


449 


H 


CYS 


A 


31 


-3.045 


7.312 


-1.839 


1 .00 


0.00 


H 


ATOM 


450 


HA 


CYS 


A 


31 


-1.602 


5.488 


-0.058 


1 .00 


0.00 


H 


ATOM 


451 


IHB 


CYS 


A 


31 


-4.306 


6.809 


0.261 


1 .00 


0.00 


H 


ATOM 


452 


2HB 


CYS 


A 


31 


-3.525 


5 788 


1 .463 


1.00 


0.00 


H 


ATOM 


453 


N 


TYR 


/\ 


32 


-2.581 


3.455 


-1.109 


1.00 


0.00 


N 


ATOM 


454 


CA 


TYR 


A 


32 


-3.182 


2.203 


-1.560 


1.00 


0.00 


c 


ATOM 


455 


C 


T^'R 


A 


32 


-2.630 


1.021 


-0.770 


1.00 


0.00 


c 


ATOM 


456 


O 


T^'R 


A 


32 


-1.607 


1.136 


-0.097 


1.00 


0.00 


o 


ATOM 


457 


CB 


T^'R 


A 


32 


-2.933 


1.994 


-3.056 


1.00 


0.00 


c 


ATOM 


458 


CG 


T^'R 


A 


32 


-1.568 


2.454 


-3.520 


1.00 


0.00 


c 


ATOM 


459 


CDl 


T^'R 


A 


32 


-0.413 


2.005 


-2.892 


1.00 


0.00 


c 


ATOM 


460 


CD2 


T^R 


A 


32 


-1.437 


3.336 


-4.584 


1.00 


0.00 


c 


ATOM 


461 


CEl 


TYR 


A 


32 


0.837 


2.423 


-3.314 


1.00 


0.00 


c 


ATOM 


462 


CE2 


T^'R 


A 


32 


-0.192 


3.757 


-5.01 1 


1.00 


0.00 


c 


ATOM 


463 


CZ 


T^'R 


A 


32 


0.940 


3.299 


-4.374 


1.00 


0.00 


c 


ATOM 


464 


OH 


T^'R 


A 


32 


2.182 


3.717 


-4.797 


1.00 


0.00 


o 


ATOM 


465 


H 


T^R 


A 


32 


-1.616 


3.491 


-0.942 


1.00 


0.00 


H 


ATOM 


466 


HA 


TYR 


A 


32 


-4.246 


2.268 


-1.389 


1.00 


0.00 


H 


ATOM 


467 


IHB 


T^'R 


A 


32 


-3.022 


0.942 


-3.285 


1.00 


0.00 


H 


ATOM 


468 


2HB 


T^'R 


A 


32 


-3.676 


2.542 


-3.616 


1.00 


0.00 


H 


ATOM 


469 


HDl 


T^'R 


A 


32 


-0.497 


1.318 


-2.062 


1.00 


0.00 


H 


ATOM 


470 


HD2 


TYR 


A 


32 


-2.326 


3.694 


-5.082 


1.00 


0.00 


H 


ATOM 


471 


HEl 


TYR 


A 


32 


1.723 


2.062 


-2.814 


1.00 


0.00 


H 


ATOM 


472 


HE2 


T^'R 


A 


32 


-0.1 12 


4.445 


-5.842 


1.00 


0.00 


H 


ATOM 


473 


HH 


T^'R 


A 


32 


2.193 


4.675 


-4.859 


1.00 


0.00 


H 


ATOM 


474 


N 


CYS 


A 


33 


-3.317 


-0.116 


-0.854 


1.00 


0.00 


N 


ATOM 


475 


CA 


C^'S 


A 


33 


-2.895 


-1.319 


-0.141 


1.00 


0.00 


c 


ATOM 


476 


C 


C^'S 


A 


33 


-1.444 


-1.671 


-0.460 


1.00 


0.00 


c 


ATOM 


477 


O 


CVS 


A 


,1.1 


-0.976 


-1.477 


-1.582 


1.00 


0.00 


o 


ATOM 


478 


CB 


C^'S 


A 


33 


-3.810 


-2.497 


-0.489 


1.00 


0.00 


c 


ATOM 


479 


SG 


CYS 


A 


33 


-3.186 


-4.122 


0.064 


1.00 


0.00 


s 


ATOM 


480 


H 


C^S 


A 


33 


-4.127 


-0.145 


-1.406 


1.00 


0.00 


H 


ATOM 


481 


HA 


C^'S 


A 


33 


-2.975 


-1.118 


0.918 


1.00 


0.00 


H 


ATOM 


482 


IHB 


CYS 


A 


33 


-4.773 


-2.344 


-0.029 


1.00 


0.00 


H 


ATOM 


483 


2HB 


CYS 


A 


33 


-3.933 


-2.542 


-1.562 


1.00 


0.00 


H 


ATOM 


484 


N 


ARG 


A 


34 


-0.747 


-2.197 


0.540 


1.00 


0.00 


N 
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ATOM 


48ri 


CA 


ARG 


/\ 


34 


0.647 


-2.591 


0.386 


1 .00 


().()() 


C 


ATOM 


486 


C 


ARG 


A 


34 


0.907 


-3.912 


1.103 


1 .00 


0.00 


c 


ATOM 


487 


O 


ARG 


A 


34 


0.720 


-4.018 


2.316 


1 .00 


0.00 


o 


ATOM 


488 


CB 


ARG 


/\ 


34 


1.573 


-1.506 


0.936 


1 .00 


0.00 


c 


ATOM 


489 


CO 


ARG 


A 


34 


3.050 


-1.838 


0.796 


1 .00 


0.00 


c 


ATOM 


490 


CD 


ARG 


/\ 


34 


3.891 


-0.582 


0.635 


1.00 


0.00 


c 


ATOM 


491 


NE 


ARG 


A 


34 


5.318 


-0.855 


0.790 


1.00 


0.00 


N 


ATOM 


492 


CZ 


ARG 


A 


34 


6.078 


-1.392 


-0.162 


1.00 


0.00 


c 


ATOM 


493 


NHl 


ARG 


A 


34 


5.551 


-1.715 


-1.337 


1.00 


0.00 


N 


ATOM 


494 


NH2 


ARG 


A 


34 


7.366 


-1.606 


0.062 


1.00 


0.00 


N 


ATOM 


495 


H 


ARG 


A 


34 


-1.183 


-2.327 


1.407 


1.00 


0.00 


H 


ATOM 


496 


HA 


ARG 


A 


34 


0.842 


-2.721 


-0.669 


1.00 


0.00 


H 


ATOM 


497 


IHB 


ARG 


A 


34 


1.382 


-0.584 


0.408 


1.00 


0.00 


H 


ATOM 


198 


2HB 


ARG 


A 


34 


1.357 


-1.361 


1.984 


1.00 


0.00 


H 


ATOM 


499 


IHG 


ARG 


A 


34 


3.375 


-2.365 


1 .68 1 


1.00 


0.00 


H 


ATOM 


500 


2HG 


ARG 


A 


34 


3.187 


-2.466 


-0.072 


1.00 


0.00 


H 


ATOM 


501 


IHD 


ARG 


A 


34 


3.717 


-0.172 


-0.349 


1.00 


0.00 


H 


ATOM 


502 


2HD 


ARG 


A 


34 


3.587 


0.137 


1.381 


1.00 


0.00 


H 


ATOM 


503 


HE 


ARG 


A 


34 


5.731 


-0.626 


1.648 


1.00 


0.00 


H 


ATOM 


504 


IHHl 


ARG 


A 


34 


4.580 


-1.557 


-1.513 


1.00 


0.00 


H 


ATOM 


505 


2HH1 


ARG 


A 


34 


6.128 


-2.118 


-2.048 


1.00 


0.00 


H 


ATOM 


506 


1HH2 


ARG 


A 


34 


7.767 


-1.363 


0.946 


1.00 


0.00 


H 


ATOM 


507 


2HH2 


ARG 


A 


34 


7.937 


-2.009 


-0.653 


1.00 


0.00 


H 


ATOM 


508 


N 


L^'S 


A 


35 


1.330 


-4.919 


0.345 


1.00 


0.00 


N 


ATOM 


509 


CA 


L^'S 


A 


35 


1.609 


-6.237 


0.908 


1.00 


0.00 


c 


ATOM 


510 


C 


L^'S 


A 


35 


3.021 


-6.317 


1.486 


1.00 


0.00 


c 


ATOM 


511 


O 


LYS 


A 


35 


3.507 


-7.404 


1.797 


1.00 


0.00 


o 


ATOM 


512 


CB 


L^'S 


A 


35 


1.423 


-7.316 


-0.160 


1.00 


0.00 


c 


ATOM 


513 


CG 


L^'S 


A 


35 


0.015 


-7.886 


-0.206 


1.00 


0.00 


c 


ATOM 


514 


CD 


L^'S 


A 


35 


-0.268 


-8.568 


-1.535 


1.00 


0.00 


c 


AlOM 


515 


CE 


LYS 


A 


35 


0.516 


-9.866 


1 .674 


1.00 


0.00 


c 


ATOM 


516 


NZ 


L^S 


A 


35 


-0.380 


-1 1.052 


-1.750 


1.00 


0.00 


N 


ATOM 


517 


H 


L^S 


A 


35 


1 .454 


-4.775 


-0.617 


1.00 


0.00 


H 


ATOM 


518 


HA 


S 


A 


35 


0.899 


-6.410 


1 .703 


1 .00 


0.00 


H 


ATOM 


519 


IHB 


LYS 


A 


35 


1.649 


-6.893 


-1.127 


1.00 


0.00 


H 


ATOM 


520 


2HB 


LYS 


A 


35 


2.1 10 


-8.125 


0.039 


1.00 


0.00 


H 


ATOM 


521 


IHG 


L^'S 


A 


35 


-0.098 


-8.608 


0.589 


1 .00 


0.00 


H 


ATOM 


522 


2HG 


LYS 


A 


35 


-0.694 


-7.082 


-0.067 


1 .00 


0.00 


H 



-88- 



4()7T 9S091()L S 



ATOM 


523 


IHD 


LYS 


ATOM 


524 


2HD 


l.YS 


ATOM 


525 


IHH 


LYS 


ATOM 


526 


2HE 


LYS 


ATOM 


527 


IHZ 


LYS 


ATOM 


528 


2 HZ 


LYS 


ATOM 


529 


3 HZ 


LYS 


ATOM 


530 


N 


LEU 


ATOM 


531 


CA 


LEU 


ATOM 


532 


C 


LEU 


ATOM 


533 


O 


LEU 


ATOM 


534 


CB 


LEU 


ATOM 


535 


CG 


LEU 


ATOM 


536 


GDI 


LEU 


ATOM 


537 


CD2 


LEU 


ATOM 


538 


H 


LEU 


ATOM 


539 


HA 


LEU 


ATOM 


540 


IHB 


LEU 


ATOM 


541 


2HB 


LEU 


ATOM 


542 


HG 


LEU 


ATOM 


543 


IHDl 


LEU 


ATOM 


544 


2HD1 


LEU 


ATOM 


545 


3HD1 


LEU 


ATOM 


546 


1HD2 


LEU 


ATOM 


547 


2HD2 


LEU 


ATOM 


548 


3HD2 


LEU 


ATOM 


549 


N 


GLY 


ATOM 


550 


CA 


GLY 


ATOM 


551 


C 


GLY 


ATOM 


552 


O 


GLY 


ATOM 


553 


H 


GLY 


ATOM 


554 


IHA 


GLY 


ATOM 


555 


2HA 


GLY 


ATOM 


556 


N 


THR 


ATOM 


557 


GA 


THR 


ATOM 


558 


G 


THR 


ATOM 


559 


O 


THR 


A.TOM 


560 


GB 


THR 



A 35 -1.322 -8.790 

A 35 ().()! 2 -7.902 

A 35 1.109 -9.817 

A 35 1.169 -9.973 

A 35 -0.019 -11.814 

A 35 -0.430 -11.401 

A 35 -1.340 -10.797 

A 36 3.679 -5.167 

A 36 5.025 -5.128 

A 36 5.990 -5.915 

A 36 5.918 -7.141 

A 36 5.039 -5.664 

A 36 4.725 -4.611 

A 36 3.342 -1.840 

A 36 5.783 -4.613 

A 36 3.251 -4.329 

A 36 5.339 -4.096 

A 36 4.308 -6.458 

A 36 6.017 -6.072 

A 36 4.730 -3.640 

A 36 2.594 -4.694 

A 36 3.176 -4.138 

A 36 3.276 -5.848 

A 36 5.340 -4.294 

A 36 6.581 -3.934 

A 36 6.183 -5.609 

A 37 6.890 -5.193 

A 37 7.871 -5.823 

A 37 9.003 -6.472 

A 37 10.174 -6.191 

A 37 6.888 -4.219 

A 37 7.376 -6.578 

A 37 8.284 -5.076 

A 38 8.654 -7.347 

A 38 9.649 -8.042 

A 38 10.046 -9.357 

A 38 9.603 -10.429 

A 38 9.110 -8.298 
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-1.598 


1 .00 


0.00 


H 


-2.338 


1 .00 


0.00 


H 


-2.576 


1 .00 


0.00 


H 


-0.820 


1 .00 


0.00 


H 


-1.140 


1 .00 


0.00 


H 


-2,728 


1 .00 


0.00 


H 


- 1 .436 


1.00 


0.00 


H 


1.633 


1.00 


0.00 


N 


2.175 


1.00 


0.00 


G 


1.305 


1.00 


0.00 


G 


1.223 


1.00 


0.00 


o 


3.604 


1.00 


0.00 


G 


4.663 


1.00 


0.00 


G 


5.256 


1.00 


0.00 


G 


5.756 


1.00 


0.00 


G 


1.374 


LOO 


0.00 


H 


2.190 


1.00 


0.00 


H 


3.678 


1.00 


0.00 


H 


3.807 


1.00 


0.00 


H 


4.190 


1.00 


0.00 


H 


4.490 


1.00 


0.00 


H 


6.060 


1.00 


0.00 


H 


5.638 


1.00 


0.00 


H 


6.687 


1.00 


0.00 


H 


5.486 


1.00 


0.00 


H 


5.870 


1.00 


0.00 


H 


0.658 


1.00 


0.00 


N 


-0.206 


1.00 


0.00 


G 


0.567 


1.00 


0.00 


G 


0.315 


1.00 


0.00 


o 


0.770 


1.00 


0.00 


H 


-0.800 


1.00 


0.00 


H 


-0.868 


1.00 


0.00 


H 


1.506 


1.00 


0.00 


N 


2.312 


1.00 


0.00 


G 


1 .655 


1.00 


0.00 


G 


2.070 


1.00 


0.00 


O 


3.720 


1.00 


0,00 


G 
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ATOM 


561 


OGl 


THR 


A 


38 


ATOM 


562 


CG2 


THR 


A 


38 


ATOM 


563 


H 


THR 


A 


38 


AlOM 


564 


HA 


THR 


A 


38 


ATOM 


565 


HB 


THR 


A 


38 


ATOM 


566 


HGl 


THR 


A 


38 


ATOM 


567 


1HG2 


THR 


A 


38 


ATOM 


568 


2HG2 


THR 


A 


38 


ATOM 


569 


3HG2 


THR 


A 


38 


ATOM 


570 


N 


ALA 


A 


39 


ATOM 


571 


CA 


ALA 


A 


39 


ATOM 


572 


C 


ALA 


A 


39 


ATOM 


573 


o 


ALA 


A 


39 


ATOM 


574 


CB 


ALA 


A 


39 


ATOM 


575 


H 


ALA 


A 


39 


ATOM 


576 


HA 


ALA 


A 


39 


ATOM 


577 


IHB 


ALA 


A 


39 


ATOM 


578 


2HB 


ALA 


A 


39 


ATOM 


579 


3HB 


ALA 


A 


39 


ATOM 


580 


N 


MET 


A 


40 


ATOM 


581 


CA 


MET 


A 


40 


ATOM 


582 


C 


MEl^ 


A 


40 


ATOM 


583 


O 


MET 


A 


40 


ATOM 


584 


CB 


MET 


A 


40 


ATOM 


585 


CG 


MET 


A 


40 


ATOM 


586 


SD 


MET 


A 


40 


ATOM 


587 


CE 


MET 


A 


40 


ATOM 


588 


H 


MET 


A 


40 


ATOM 


589 


HA 


MET 


A 


40 


ATOM 


590 


IHB 


MET 


A 


40 


ATOM 


591 


2HB 


MET 


A 


40 


ATOM 


592 


IHG 


MET 


A 


40 


ATOM 


593 


2HG 


MET 


A 


40 


ATOM 


594 


IHE 


MET 


A 


40 


ATOM 


595 


2HE 


MET 


A 


40 


ATOM 


596 


3 HE 


MET 


A 


40 


ATOM 


597 


N 


ASN 


A 


41 


ATOM 


598 


CA 


ASN 


A 


4! 




7.895 


-9.024 


3.669 


1.00 


0.00 


o 


8.854 


-7.029 


4.505 


1 .00 


0.00 


c 


7.707 


-7.534 


1 .658 


1 .00 


0.00 


H 


10.522 


-7.408 


2.379 


1 .00 


0.00 


H 


9.832 


-8.885 


4.269 


1.00 


0.00 


H 


7.196 


-8.459 


3.332 


1.00 


0.00 


H 


8.717 


-6.205 


3.820 


1.00 


0.00 


H 


9.698 


-6.827 


5.147 


1.00 


0.00 


H 


7.965 


-7.150 


5.106 


1.00 


0.00 


H 


10.878 


-9.267 


0.625 


1.00 


0.00 


N 


11.335 


-10.449 


-0.099 


1.00 


0.00 


c 


12.107 


-11.402 


0.810 


1.00 


0.00 


c 


12.308 


-12.568 


0.469 


1.00 


0.00 


o 


12.191 


-10.040 


-1.288 


1.00 


0.00 


c 


11.190 


-8.381 


0.342 


1.00 


0.00 


H 


10.464 


-10.959 


-0.474 


1.00 


0.00 


H 


11.591 


-9.475 


-1.985 


1.00 


0.00 


H 


12.577 


-10.922 


-1.775 


1.00 


0.00 


H 


13.014 


-9.429 


-0.946 


1.00 


0.00 


H 


12.536 


-10.904 


1.963 


1.00 


0.00 


N 


13.283 


-11.718 


2.914 


1.00 


0.00 


c 


12.334 


-12.525 


3.796 


1.00 


0.00 


c 


12.618 


-13.671 


4.143 


1.00 


0.00 


o 


14.178 


-10.833 


3.783 


1.00 


0.00 


c 


15.537 


-11.448 


4.081 


1.00 


0.00 


c 


16.877 


-10.242 


4.016 


1.00 


0.00 


s 


16.815 


-9.759 


2.292 


1.00 


0.00 


c 


12.347 


-9.970 


2.179 


1.00 


0.00 


H 


13.902 


-12.400 


2.352 


1.00 


0.00 


H 


14.336 


-9.892 


3.277 


1.00 


0.00 


H 


13.679 


-10.645 


4.722 


1.00 


0.00 


H 


15.512 


-11.882 


5.069 


1.00 


0.00 


H 


15.733 


-12.222 


3.353 


1.00 


0.00 


H 


16.197 


-8.880 


2.186 


1.00 


0.00 


H 


16.398 


-10.566 


1.708 


1.00 


0.00 


H 


17.814 


-9.541 


1 .943 


1.00 


0.00 


H 


1 1.206 


-11.920 


4.149 


1.00 


0.00 


N 


10.211 


-12.581 


4.986 


1.00 


0.00 


c 
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ATOM 599 C ASN 

ATOM 600 O ASN 

ATOM 601 CB ASN 

ATOM 602 CG ASN 

ATOM 603 ODl ASN 

ATOM 604 ND2 ASN 

ATOM 605 H ASN 

ATOM 606 HA ASN 

ATOM 607 IHB ASN 

ATOM 608 2HB ASN 

ATOM 609 1HD2 ASN 

ATOM 610 2HD2 ASN 

ATOM 611 N PRO 

ATOM 612 CA PRO 

ATOM 613 C PRO 

ATOM 614 O PRO 

ATOM 615 CB PRO 

ATOM 616 CG PRO 

ATOM 617 CD PRO 

ATOM 618 HA PRO 

ATOM 619 IHB PRO 

ATOM 620 2HB PRO 

ATOM 621 IHG PRO 

ATOM 622 2HG PRO 

ATOM 623 IHD PRO 

ATOM 624 2HD PRO 

ATOM 625 N CYS 

ATOM 626 CA C^'S 

ATOM 627 C C^'S 

ATOM 628 O C^'S 

ATOM 629 CB C^ S 

ATOM 630 SG C^'S 

ATOM 631 H C^'S 
ATOM 632 HA C^'S 
ATOM 633 IHB CYS 
ATOM 634 2HB CYS 
ATOM 635 N SER 
ATOM 636 CA SER 




A 


41 


8.806 


-12.072 


4.664 


A 


41 


8.174 


-11 402 


5.481 


A 


41 


10.529 


-12.356 


6.467 


A 


41 


11. 147 


-13.578 


7.1 17 


A 


41 


1 1 .058 


-14.688 


6.591 


A 


4! 


11.780 


-13.382 


8.268 


A 


41 


1 1.035 


-1 1.007 


3.837 


A 


41 


10.253 


-13.639 


4.775 


A 


41 


1 1.223 


-11.534 


6.559 


A 


41 


9.618 


-12.114 


6.994 


A 


41 


11.812 


-12.471 


8.629 


A 


41 


12.189 


-14.156 


8.710 


A 


42 


8.299 


-12.384 


3.459 


A 


42 


6.966 


-11.956 


3.028 


A 


42 


5.855 


-12.784 


3.665 


A 


42 


6.027 


-13.974 


3.927 


A 


42 


7.002 


-12.182 


1.518 


A 


42 


7.954 


-13.313 


1.332 


A 


42 


8.987 


-13.177 


2.420 


A 


42 


6.800 


-10.909 


3.235 


A 


42 


6.013 


-12.435 


1.165 


A 


42 


7.351 


-11. 289 


1 .023 


A 


42 


7.431 


-14.252 


1.428 


A 


42 


8.422 


-13.243 


0.362 


A 


42 


9.265 


-14.149 


2.800 


A 


42 


9.857 


-12.655 


2.051 


A 


43 


4.715 


-12.147 


3.909 


A 


43 


3.582 


-12.799 


4.505 


A 


43 


2.788 


-13.563 


3.458 


A 


43 


3.287 


-14.505 


2.841 


A 


43 


2.717 


-1 1.741 


5.173 


A 


43 


3.620 


-10.655 


6.325 


A 


43 


4.627 


-1 1.204 


3.684 


A 


43 


3.923 


-13.485 


5.246 


A 


43 


2.274 


-11.119 


4.409 


A 


43 


1.942 


-1 2 227 


5.718 


A 


44 


1.554 


-13.151 


3.270 


A 


44 


0.670 


-13.781 


2.308 



1.00 


0.00 


c 


1 .00 


0.00 


o 


1 .00 


0.00 


c 


1.00 


0.00 




1 .00 


0.00 


o 


i OO 


0.00 


N 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


N 


1.00 


0.00 


c 


1.00 


0.00 


c 


1.00 


0.00 


o 


1.00 


0.00 


c 


1.00 


0.00 


c 


1.00 


0.00 


c 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


H 


1.00 


0.00 


N 


1.00 


0.00 


C 


1.00 


0.00 


C 


1.00 


0.00 


o 


1.00 


0.00 


C 



1 .00 0.00 s 

1 .00 0.00 H 

1 .00 0.00 H 

1.00 0.00 H 

1.00 0.00 H 

1.00 0.00 N 

1.00 0.00 C 
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ATOM 


637 


C 


SER 


A 


44 


0.936 


-13.263 


0.900 


1 .00 


0.00 


c 


ATOM 


638 


O 


SER 


A 


44 


0,285 


-12.326 


0.439 


1 .00 


0.00 


o 


ATOM 


639 


CB 


SER 


A 


44 


-0.791 


-13.539 


2.688 


1 .00 


0.00 


c 


ATOM 


640 


OG 


SER 


A 


44 


-1.145 


-14.269 


3.851 


1 .00 


0.00 


o 


ATOM 


641 


H 


SER 


/\ 


44 


1.231 


-12.404 


3.801 


1.00 


0.00 


H 


ATOM 


642 


HA 


SER 


/\ 


44 


0.869 


-14.835 


2.337 


1.00 


0 00 


H 


ATOM 


643 


IHB 


SER 


A 


44 


-0.943 


-12.488 


2.879 


1 .00 


0.00 


H 


ATOM 


644 


2HB 


SER 


A 


44 


-1.429 


-13.852 


1.874 


1.00 


0.00 


H 


ATOM 


645 


HQ 


SER 


A 


44 


-1.908 


-14.821 


3.663 


1.00 


0.00 


H 


ATOM 


646 


N 


ARG 


A 


45 


1.898 


-13.880 


0.219 


1.00 


0.00 


N 


ATOM 


647 


CA 


ARG 


A 


45 


2.250 


-13.481 


-1.138 


1.00 


0.00 


c 


ATOM 


648 


C 


ARG 


A 


45 


2.882 


-14.639 


-1.904 


1.00 


0.00 


c 


ATOM 


649 


O 


ARG 


A 

J. V 


45 


3.804 


-14.443 


-2.697 


1.00 


0.00 


o 


ATOM 


650 


CB 


ARG 


A 


45 


3.211 


-12.291 


-1.108 


1.00 


0.00 


c 


ATOM 


651 


CO 


ARG 


A 


45 


3.124 


-11.405 


-2.341 


1.00 


0.00 


c 


ATOM 


652 


CD 


ARG 


A 


45 


4.407 


-11.447 


-3.153 


1.00 


0.00 


c 


ATOM 


653 


NE 


ARG 


A 


45 


5.470 


-10.655 


-2.540 


1.00 


0.00 


N 


ATOM 


654 


CZ 


ARG 


A 


45 


6.607 


-10.337 


-3.156 


1.00 


0.00 


c 


ATOM 


655 


NHl 


ARG 


A 


45 


6.833 


-10.743 


-4.399 


1.00 


0.00 


N 


ATOM 


656 


NH2 


ARG 


A 


45 


7.520 


-9.611 


-2.526 


1.00 


0.00 


N 


ATOM 


657 


H 


ARG 


A 


45 


2.382 


-14.621 


0.639 


1.00 


0.00 


H 


ATOM 


658 


HA 


ARG 


A 


45 


1.343 


-13.184 


-1.643 


1.00 


0.00 


H 


ATOM 


659 


IHB 


ARG 


A 


45 


2.988 


-1 1.685 


-0.241 


1.00 


0.00 


H 


ATOM 


660 


2HB 


ARG 


A 


45 


4.222 


-12.661 


-1.027 


1.00 


0.00 


H 


ATOM 


661 


IHG 


ARG 


A 


45 


2.307 


-11.746 


-2.960 


1.00 


0.00 


H 


ATOM 


662 


2HG 


ARG 


A 


45 


2.939 


-10.387 


-2.028 


1.00 


0.00 


H 


ATOM 


663 


IHD 


ARG 


A 


45 


4.737 


-12.473 


-3.230 


I.OO 


0.00 


H 


ATOM 


664 


2HD 


ARG 


A 


45 


4.206 


- 1 1 .06 1 


-4.142 


1.00 


0.00 


H 


ATOM 


665 


HE 


ARG 


A 


45 


5.330 


-10.342 


-1.622 


1.00 


0.00 


H 


ATOM 


666 


IHHl 


ARG 


A 


45 


6.148 


-1 1.291 


-4.880 


1.00 


0.00 


H 


ATOM 


667 


2HH1 


ARG 


A 


45 


7.688 


-10.501 


-4.856 


1.00 


0.00 


H 


ATOM 


668 


1HH2 


ARG 


A 


45 


7.356 


-9.303 


-1.590 


1.00 


0.00 


H 


ATOM 


669 


2HH2 


ARG 


A 


45 


8.375 


-9.373 


-2.989 


1.00 


0.00 


H 


ATOM 


670 


N 


THR 


A 


46 


2.382 


-15.846 


-1.661 


1.00 


0.00 


N 


ATOM 


671 


CA 


THR 


A 


46 


2.899 


-17.036 


-2.328 


1.00 


0.00 


C 


ATOM 


672 


C 


THR 


A 


46 


2.368 


-17.133 


-3.754 


1.00 


0.00 


C 


ATOM 


673 


O 


THR 


A 


46 


2.909 


-17.946 


-4.533 


1.00 


0.00 


o 


ATOM 


674 


CB 


THR 


A 


46 


2.522 


-18.293 


-1.543 


1.00 


0.00 


c 
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ATOM 


675 


OGl 


THR 


A 


46 


ATOM 


676 


CG2 


THR 


A 


46 


ATOM 


677 


oxr 


THR 


A 


46 


ATOM 


678 


H 


THR 


A 


46 


ATOM 


679 


HA 


THR 


A 


46 


ATOM 


680 


HB 


THR 


AV 


46 


ATOM 


681 


HGl 


THR 


A 


46 


Pi 1 LJiVl 




1 1 1 v_J — 


THR 


A 


46 


ATOM 


683 


2HG2 


THR 


A 


46 


ATOM 


684 


3HG2 


THR 


A 


46 


TER 


685 




THR 


A 


46 



.802 


-18.127 


-0.165 


1.00 


0.00 


o 


.250 


-19.534 


-2.014 


1.00 


0.00 


c: 


.413 


-16.397 


-4.081 


1 .00 


0.00 


o 


.648 


-15.940 


-1.019 


1.00 


0.00 


H 


.976 


-16.954 


-2.363 


1.00 


0.00 


H 


.461 


-18.470 


-1.656 


1,00 


0.00 


H 


.0 1 1 


-17.825 


0.289 


1.00 


0.00 


H 


.933 


-19.271 


-2.809 


1.00 


0,00 


H 


.534 


-20.255 


-2.378 


1.00 


0.00 


H 


.803 


-19.960 


-1.191 


1.00 


0,00 


H 



Example 3 

Screening of mini-AGRPs for melanocortin Recept or activity. 

The mini-AGRP, given by the sequence: Ac- 
CVRLHESCLGQQVPCCDPAATCYCRFFNAFCYCR-NH2(SEQ ID NO:5) was 
synthesized using Fmoc solid phase chemistry (where Ac is an acetyl group and -NH2 
denotes a C-tcrminal amide). The peptide was prepared as a single chain and then partially 
purified by HPLC. The peptide was then folded under oxidizing conditions to form the 
0 disulfide bonds, and then repurified. HPLC indicated a uniform product and mass 

spectrometry demonstrated loss of 8.0 AMUs consistent with formation of four disulfide 
bonds. The folding yield was greater than 80% as determined by HPLC. 

Amino acid analysis verified the correct omposition and allowed for the 
preparation of stock solutions of known mini-AGRP concentration (to within 5%). 
5 The mini-AGRP binding was tested by examining its ability to displace 

NDP-MSH, a well established high affinity MCR agonist. To within experimental error, the 
mini-AGRP displaced NDP-MSH at each of the melanocortin receptors (MCIR. MC3R, 
MC4R and MC5R) with potency equivalent to that of AGRP(87-132) (MARP). Antagonist 
activity was then determined using a cAMP assay. With both brain receptors MC3R and 
0 MC4R, mini-AGRP exhibited potent activity as determined by its ability to suppress a- 
MSH stimulated cAMP production. Taken together, these studies demonstrate that this 
mini-AGRP is readily synthesized, folded to a uniform product, and exhibits MCR binding. 
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sclcclivity and anlagonist activity comparable to that of the AGRP C-terminal domain 
(AGRP(87-132): MARP ). 

Kxample 4 

Synthesis and Assav of Melunocortin Receptor Binding Ligand s (Peptidomimetics) 

A) Synthesis of non-peptide Compounds 

A synthesis scheme for the peptidomimetics is illustrated in Figure 4. To 
20% Pipcridine/DMF 2 x 5min treated and DMF washed Rink Amide-MBHA resin 
(Novabiochem, Switzerland), 5 equivalents of Fmoc-Aminoisobutyric acid (Fmoc-Aib) or 7 
of Bromoacetic acid (Aldrich, USA) in DMF, activated as the symmetric anhydride by 
mcubating 30 min. with Diisopropylcarbodiimide (DIC), is reacted for 20 min., then washed 
with DMF. The washed resin, in the case of Fmoc-Aib treated with 20% Piperidine/DMF 2 
X 5 min. and DMF washed, is reacted with 20-40 equivalents, concentration 2-4 M, of 
primary or secondary amine or bromide of choice in DMSO or DMF with the appropriate 
tertiary amine (note: methylamine is only available as a 2M solution in various solvents). 
This is known as the submonomer approach (Zuckermann et al., 7 Amer Chem Soc 1992). 

The process is repeated 1-2 times, until yielding a complete molecule, or 
intermediate. To the intermediate, 7 equivalents of bromoacetic acid or 5-8 equivalents of a 
carboxylic acid of choice, activated as the symmetric anhydride by incubating 30 min. with 
Diisopropylcarbodiimide (DIC) or as the assymetric anhydride, in the case of an 
enantiomeric carboxylic acid, is reacted for 0.5-2 hr., then washed with DMF. If the 
intermidiate is brominated then reacted with 20 equivalents of the thiol or 2-4 M of the 
primary or secondary amine of choice, if the thiol or amine contains one or more accessible 
amine then the carboxylic acid of choice is coupled or the amine is guanylated. If the 
intermediate has an accessible amine then one of the following may happen: I) alkylation 
by the primary or secondary bromide of choice 2) guanylation 3) a carboxylic acid of choice 
IS coupled. The reaction of amines with 3,5-Dimethylpyrazole-l-caiboxamidinc nitrate or 
O-methylisourea is well known in the art. The resulting non-peplides are cleaved, usually, 
but not limited to, by exposing the resin bound molecules to 95:5 TFA:H20. The cleavage 
from Resin is done in 95:2.5:2.5 TFA:TIS:water. 



-94- 



4077-9809 lOrS 



B) Activity Assay 

niZK 293 cells slably expressing the human mclanocortin receptors 
(liMCR^) 1-5 are gnnvn in media. Alter remc)val ivom media, cells arc washed twice with 
MEM (Life Technologies, Gailhcrsberg, MD) and then prcincubated with AGRP (87-132) 
5 in 0.5 ml MEM i).2Vc BSA Tor 30 min. belore incubation with ''"l-NDP-MSH or 125 1- 
AGRP (87-132). Concentrations of cold ligand are added from 100 mM - 1 nM. Binding 
reactions are terminated by washing cells twice with MEM 0.2% BSA. The cells are lysed 
wnth .1 N NaOH 1% Triton X, and radioactivity of the lysate is quantified with a gamma- 
analytical counter. AGRP (87-132) hMC4R ICso = 9 nM ± E7 nM. IC50 < 100 mM is 

10 considered a specific binder 

It is apparent from the above results and discussion that the subject invention 
provides an miportant new class of MCR ligands, where the ligands have broad ranging 
applicability as either MCR antagonists and agonists and therefore fmd use in a variety of 
therapeutic and related applications. 

15 It is understood that the examples and embodiments described herein are for 

illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the purview of this 
application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference in their entirety for all 

20 purposes. 



